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SEMIPARAMETRIC ESTIMATION FOR STATIONARY PROCESSES 
WHOSE SPECTRA HAVE AN UNKNOWN POLE 1 

By Javier Hidalgo 

London School of Economics 

We consider the estimation of the location of the pole and mem- 
ory parameter, A and a, respectively, of covariance stationary lin- 
ear processes whose spectral density function /(A) satisfies /(A) ~ 
C|A — A°|~ Q in a neighborhood of A . We define a consistent estimator 
of A and derive its limit distribution Z x a . As in related optimization 
problems, when the true parameter value can lie on the boundary of 
the parameter space, we show that Z x o is distributed as a normal 
random variable when A £ (0, 7r), whereas for A = or it, Z x o is 
a mixture of discrete and continuous random variables with weights 
equal to 1/2. More specifically, when A = 0, Z x o is distributed as 
a normal random variable truncated at zero. Moreover, we describe 
and examine a two-step estimator of the memory parameter a, show- 
ing that neither its limit distribution nor its rate of convergence is 
affected by the estimation of A . Thus, we reinforce and extend previ- 
ous results with respect to the estimation of a when A is assumed to 
be known a priori. A small Monte Carlo study is included to illustrate 
the finite sample performance of our estimators. 

1. Introduction. Given a covariance stationary process {xt} observed at 
times t = 1, 2, . . . , n, the search for cyclical components and their estimation 
and testing are of undoubted interest. This is motivated by the observed 
periodic behavior exhibited in many time series and manifested by sharp 
peaks of the spectral density estimate. 

A well-known model capable of generating such a periodic behavior is the 
regression model 

(1.1) x t = n + pi cos(A°t) + p2 sin(A°t) + e t , 
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where p\ and pi are zero-mean uncorrelated random variables with the 
same variance and {ej} is a stationary sequence of random variables in- 
dependent of p\ and pi. Model (1.1) has enjoyed extensive use and different 
techniques have been proposed for the estimation of the frequency, ampli- 
tude and phase (see [6, 18, 19, 20, 35]). Extensions to a model with more 
than one periodic component have been examined by Quinn [29] and Kava- 
lieris and Hannan [25], whose interest was also in testing the number of 
sinusoidal/cosinusoidal components. See also [30]. 

A second statistical model capable of exhibiting peaks in its spectral den- 
sity function is the autoregressive AR(2) process 

(1.2) (l-a 1 L-a 2 L 2 )x t = £ t 

when the zeros of the polynomial (1 — a\L — CL2L 2 ) are complex, with A 
identified as arccos(7j^=). Models (1.1) and (1.2) represent two extreme 
situations explaining cyclical behavior of the data and the peakedness of 
the spectral density function. Model (1.2) possesses a continuous spectral 
density function whereas model (1.1) has a spectral distribution function 
with a jump at the frequency A . The cyclical component of the data remains 
constant or invariant with time in model (1.1), whereas the cyclical pattern 
of model (1.2) fades out with time fairly quickly. 

Between these two extreme situations there exists a class of intermediate 
models in which the spectral density function of x% exhibits a pole at the 
frequency A . For that purpose, define the spectral density function of xt as 
the function /(A) which satisfies the relationship 



(1.3) 1 (j) = Cov(x t ,x t+J ) = /(A)cos(jA)dA, j = 0, 1, 2, . . . . 

We say that /(A) has a pole at A if 

(1.4) /(A)~C Y |A-A r° asA^A , 

where C € (0, 00), a £ (0, 1) is the memory parameter and "~" means that 
the ratio of the left- and right-hand sides tends to 1. One of the main ob- 
jectives of this paper is the estimation of A . 

One model capable of generating such a cyclical behavior in the data has 
been proposed by Andel [2] and Gray, Zhang and Woodward [17] and defined 
as 

(1.5) (l-2(cosA°)L + L 2 ) d x i = e t , 

where L is the backshift operator, d = a/2 for A € (0, 7r), whereas for A = 
or 7r, d = a/A. The model (1.5) was coined the Gegenbauer model by Gray, 
Zhang and Woodward [17], who extended it to the GARMA model where the 
innovations {et} follow an autoregressive moving average (ARM A) process, 
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and it was later extended by Giraitis and Leipus [13] allowing for more than 
one pole or cyclical component. The GARMA process is characterized by 
having the spectral density function 



where a 2 > 0, and a(-) and &(■) are polynomials of finite degree, all of whose 
zeroes lie outside the unit circle. When A = 0, we have the more famil- 
iar FARIMA model, apparently originated by Adenstedt [1], and studied 
by Granger and Joyeux [16] and Hosking [23]. GARMA models are char- 
acterized by a stronger and more persistent cyclical behavior than ARMA 
models, that is, (1.2), but unlike model (1.1), their amplitude does not re- 
main constant over time. 

If the location of the pole A is known, then under some regularity con- 
ditions and a correct specification of the model, Whittle estimates of the 
parameters a (or d), 6 and a 2 , for example model (1.6), are known to be 
n 1 / 2 -consistent and asymptotically normal. In the case of Gaussianity or 
linear processes, this was shown by Fox and Taqqu [11], Dahlhaus [7] and 
Giraitis and Surgailis [15] when A = and generalized by Giraitis and Lei- 
pus [13] and Hosoya [24] for A different from 0. All these papers assume 
that /(A) is fully specified by a set of parameters (a, 9', a 2 )'. 

Although knowledge of A can be realistic in some time series data, with 
nonseasonal data that knowledge of A is not so clear. An example of the 
latter is when the practitioner is interested in estimating cycles in macroe- 
conomic or geophysical data. Recently, Giraitis, Hidalgo and Robinson [12] 
have shown that Whittle estimates of (a, 9' , a 2 )' are asymptotically the same 
irrespective of whether or not A is known. In addition, they proved that 
the estimate of A is n-consistent although its limit distribution remains an 
open problem. 

However, if the ultimate interest is only the estimation of the memory 
parameter a, one possible criticism of the parametric approach is that an 
incorrect specification of the model leads to inconsistent estimates of a. One 
source of misspecification is the choice of a wrong value of A . If that were 
the case, Whittle estimates of a would be inconsistent, and would possibly 
estimate the value zero. The latter might happen even if a semiparametric 
approach were adopted; see Section 3. Thus, we might conclude that the 
data is short-memory- instead of long-memory-dependent, which could have 
some adverse effects on the statistical inference of relevant statistics such as 
the serial covariances; see [21] or [33]. 

The main objectives of this paper are twofold: first, under mild conditions, 
to provide a consistent estimator of A and characterize its limit distribution. 
In particular we show that the limiting distribution of the estimator of A 



(1.6) /(A) = — |l-2(cosA°)e iA + e 



1 b{e lX ;9) 



TV < A < 7T 
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depends on whether A £ (0, 7r) or A = or ir. The second objective is to 
investigate the consequences that the lack of knowledge of A might have on 
the estimation of a. 

Some earlier related work has been completed by Miiller and Prewitt [27] 
and Yajima [36]. In the former the authors estimate the peak of the spectral 
density /(A) in a model, like that in (1.2), whose spectral density function 
is continuous in [0,7r]. Looking at argsup A /(A), where /(A) is a smoothed 
nonparametric estimate of /(A), they show its consistency and the limit 
distribution to be a normal random variable when A G (0,7r). Yajima [36] 
considers the estimation of A in a model with spectral density function 
possessing a pole at A . Based on the maximum of the periodogram of the 
data, he gives consistency and an upper rate of convergence for the estimate 
of A . Unfortunately, the limit distribution, which is required for statistical 
inference, was not provided. In addition, his results rely on the assumption 
that the data is Gaussian, which is not required in the present paper. Finally, 
it should be mentioned that Giraitis and Leipus [13] prove the consistency 
of A in a model like (1.6). 

The paper is organized as follows. In Section 2 we describe a semipara- 
metric estimator of a when A and the estimator A of A are known. In 
Section 3 we discuss the statistical properties of A and we show that the 
asymptotic properties of a two-step estimator of a remain the same irre- 
spective of whether A is known or estimated. The finite sample behavior of 
the estimators of A and a is analyzed in Section 4 through a Monte Carlo 
study. Section 5 provides the proofs of the results given in Section 3, which 
apply a series of lemmas given in Section 6. Finally, Section 7 contains a 
summary. 

2. Regularity conditions and the estimators of the pole and memory pa- 
rameter. Let {xt} be a covariance stationary linear process observed at 
times t = 1, 2, . . . , n, with spectral density /(A) satisfying (1.4). When A is 
known, under the semiparametric specification (1.4) several estimators of 
the memory parameter a have been proposed and examined. In this paper, 
to estimate a we shall use a modification of the log-periodogram estima- 
tor (see [31]), which we now describe. Consider the average periodogram 
spectral density estimator of /(A), 



(2.1) 




where Ig 



I(\e) denotes the periodogram of xt, that is, 



n 



2 






t=l 
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and Iq = 0, where Xe = (27r£)/n for £ = 1, 2, . . . , [n/2], [z] denotes the integer 
part of z and k\ = k\(n) is a positive number which increases slowly with n, 
that is, k^ 1 + n _1 fci — > 0. (Observe that the definition of 1^ entails sample- 
mean correction.) 

Let be a weight function in (0, 1), and write 

1 k 

(2-3) a{\ q ) = -=-— ^ V P (log /g+p + log /,_„), 

where ^ p = fp{p/k), = -k~ l J2p=i tpp^og{p/k), f e = max(/ f) n" 1 ) and fe = 
fc(n), a positive number which increases slowly with n, that is, A; -1 + n~ 1 k — > 
0. 

Definition 2.1. If A is known, we define the estimator of the memory 
parameter a as a(X s ), where A s is the closest Fourier frequency X q to A . 

Remark 2.1. The motivation to use fe instead of fe in (2.3) is due to 
the singular behavior of logx at x = 0. Specifically for the proof of tightness, 
that is, Proposition 5.4 in Section 5, we have not been able to bound some 
probabilities or moments for all n > no as required. This problem, of course, 
does not appear as n — > oo as can be observed from Propositions 5.1-5.3, nor 
if our goal were to examine the behavior of a(X s ). We do not believe that 
this adjustment is needed in practice and have made it here only because we 
cannot establish Theorem 3.2 (cf. Proposition 5.4) without it, unless some 
additional stronger conditions were introduced, for instance, the normality 
of the data. 

We now define our estimator of A as A = A^ = (2irq)/n, where 

(2.4) q= argmax a(X q ). 

g =o,..., [n/2] 

Note that periodicity and symmetry around zero imply that it suffices to 
search for the maximum in (2.4) at frequencies X q , with q = 0, . . . , [n/2]. 
From (2.4) we could define a(X ) as an estimator of a, that is, (2.3) evaluated 
at A . However (see Section 3), since a(X ) does not have optimal properties, 
we will describe a two-step estimator, denoted d(A°), which overcomes all 
the adverse properties of d(A°); see Theorem 3.4. 

The motivation for the estimator in (2.4) is as follows. From the proof of 
Theorem 3.4(a) below, it is easily shown that a(X s ) is a consistent estimator 
of a. On the other hand, if X q is in any open set outside A , that is, |A g — A s | > 
S, for any arbitrarily small 5 > 0, Condition C.l below implies that 

f(X q ) = \X q -X°\~ a g(X q )~Cg(X q ). 
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That is, at the frequency X q the spectral density function behaves as if a 
were equal to zero. So, from the proof of Theorem 3.4(a) we should expect 

P A n 

that &(Xq) — > 0, implying that Pr{|A — A s | < 5} — > 1. That is, the estimator 
given in (2.4) is consistent. These heuristics will be formalized in Theo- 
rem 3.1 below. We finish this section by introducing the following regularity 
conditions and their discussion. 

Condition C.l. There exists a e (0, 1) such that 

fm = /l A - A °l^ A )' ifO<A<vr, 
J{ ' \\X + X\ a g{\), if-vr<A<0, 

where g(X) is a bounded symmetric and bounded away from zero function 
with two continuous derivatives for < A < it. 

Condition C.2. {%t} is a covariance stationary linear process, 

oo oo 

X t = (ijEt-j , $ < °°' 

3=0 j=0 

where {et} is a zero-mean i.i.d. sequence with E(e 2 ) = 1 and E\et\ e = in < oo 
for I = 3, . . . , 2r and some r > 4. 

Condition C.3. As A — > A, the function /3(A) = YljLoftje 1 ^ satisfies 
\d(3(X)/dX\=O(\X-X \- 1 \P(X)\). 

Condition C.4. k 1+L k^ 2 + k~ 2 k\ logfc + n/c^ (r2+2)/2T -> 0, for some l > 
as n — > oo, with k < ere 4 / 5 , < c < oo and where r is as in Condition C.2. 

Condition C.5. The function ip(x) is twice continuously differentiate 
with second derivative that is Lipschitz of order at least ^ in its sup- 
port (0,1) and satisfies jQip(x)dx = 0, < = — Jq i/>(x)(logx) dx < oo, 
< ■0" = Jq 1 ip"(x)(logx) dx, where ip"(x) = -£^ip(x), and \x~ 2 tp(x)\ + |(1 - 
xy 1 ip(x)\ <D<oo. 

We now discuss Conditions C.1-C.5. Condition C.l is much the same as 
that employed by Robinson [31, 32]. Indeed, Condition C.l implies that as 
A^A°, 

/(A) = C\X - A°T a (l + C-y(A°)(A - A ) + 0(|A - A°| 2 )) 

by Taylor expansion of g(X) around A and where C = g{X°). Observe that 
g'(X°) = when A = {0,7r} by symmetry of /(A), obtaining then the corre- 
sponding condition used in [31, 32]. However, we prefer to state the condition 
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in its present form since, in Theorems 3.1 and 3.2 below, some regularity con- 
ditions on /(A) are needed outside any open set containing ±A . Examples 
of processes whose spectral density function satisfies Condition C.l are the 
FARLMA(p,a/2,<?) and the GARMA model given in (1.6). Finally, the last 
part of Condition C.l is quite standard in the spectral density estimation 
literature. Condition C.2 is needed for the proof of tightness (see the proofs 
of Theorems 3.2 and 3.1). It is also required to show the uniform convergence 
of /, although for the latter property, at the expense of stronger conditions 
on the rate of convergence of k^ 1 to zero, fewer moments of e% can be as- 
sumed. Obviously Condition C.2 is satisfied if Et is Gaussian. Condition C.3 
is the same as Robinson's [32]. Condition C.4 controls the rate of increase of 
k and k\. For instance, denoting k = n 12 and k\ = n 71 , in the Gaussian case, 
we can take < 71 < 8/15 and 371/2 < 72 < min{27i/(l + t),4/5}, whereas 
for r = 4 the bounds are 4/9 < 71 < 8/15 and 371 /2 < 72 < 4/5. Finally, 
Condition C.5 characterizes the type of weight in (2.3). An example of i^{u) 
satisfying Condition C.5 is ip(u) = —u 2 + 35u 2 ' 5 /6 — 29u 3 /6 + 2u 3 logti. 

It is worth mentioning that the quadratic behavior of the weight i/j(u), 
as u — ► 0, guarantees that the first moment of £(v) (see Theorem 3.2 below) 
has a parabolic structure so that the maximum of £(v) is easily obtainable. 
Obviously, other different types of weights can be used which would not 
prevent the consistency of the estimator of A . However, for weights not 
having a quadratic behavior, the asymptotic distribution of the estimate of 
the pole is not guaranteed to be normally distributed. We will return to this 
condition after Theorem 3.2. 

3. Statistical properties of the estimators of the pole and memory pa- 
rameter. In this section we prove a functional limit theorem for a process 
operating on increments of a(X q ) near A , which together with the con- 
tinuous mapping theorem will allow obtaining the asymptotic distribution 
of A . A similar approach was used by Eddy [10] to estimate the mode of 
a probability density function and by Miiller [26] for the estimation of the 
break point in a regression model. Apart from providing the consistency and 
rate of convergence of A to A , the limit distribution will guarantee that 
asymptotic valid inferences around the true value of A may be implemented. 

The strategy of the proof to obtain the asymptotic distribution of A 
consists of three steps; see [34] , Chapter 3. Step 1 establishes the consistency 
of A to A . Step 2 establishes the rate of convergence of A to A , and 
Step 3 shows that suitably rescaled versions of a(\ q ) converge weakly to a 
limit, denoted £(v) in Theorem 3.2, in the space B[— M, M] for each finite 
< M < 00. Note that convergence in B[— M, M] for each finite < M < 00 
is to be meant convergence in B(— 00,00). See Pollard [28]. From here, the 
continuous mapping theorem will conclude that A , after normalization, will 
converge in distribution to the argmax^ £(t>). 
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The next theorem gives the consistency and rate of convergence of A to 
A , that is, Steps 1 and 2. Theorem 3.2 justifies Step 3, whereas Corollary 3.3 
examines the asymptotic distribution of A . 

Theorem 3.1. Assuming Conditions C.1-C.5, |A°-A°| = O p (k 1 / 2 n~ 1 ). 

We see that the rate of convergence of A to A is slower than the para- 
metric rate n _1 obtained by Giraitis, Hidalgo and Robinson [12]. This ap- 
pears to be reasonable due to the local behavior of our statistics. The same 
phenomenon occurs in other related, although different, problems involving 
nonparametric statistics; see, for instance, [10, 26] or [27]. 

Under Conditions C.2 and C.4, for r = 4, (A - A ) = O p (n 5 " 2 / 3 ) for any 
arbitrarily small 5 > 0. However, a closer examination of these conditions 
and the proof of Lemma 6.3 indicate that the rate depends on the number 
of finite moments of the sequence Et in Condition C.2. In general, with r > 4, 
(A - A ) = Op(n 5 -( 2r2 - 3T + 4 )/ 2 ( T2+2 )). So, the greater the number of finite 
moments allowed for et, the faster the rate of convergence of A to A . In 
the extreme case where all the moments exist, the rate of convergence of A 
becomes n 5_1 . This rate was obtained by Yajima [36] in the Gaussian case 
and is arbitrarily close to n _1 obtained in [12]. 

So Theorem 3.1 indicates that A = A + n~ l (2-n[k l / 2 v}) for some < 
M < oo. To examine the asymptotic distribution of A , let us introduce the 
notation 

(3.1) i n (v) = k(a(X s+[k i/2 v] ) - a(\ s )). 

in(v ) is a random step function which is constant in the intervals [i/k 1 / 2 , (i + 
1)A 1/2 ), \i\ <M, so that £ is a random element in the Skorohod space 
B[-M,M] for arbitrary < M < oo. 

We now establish our main result, that is, the aforementioned Step 3. 

Theorem 3.2. Let \v\ < M for any arbitrary M 6 (0,oo). Assuming 
Conditions C.1-C.5, 

^ n ( v ) w => y £( v ) in the space B[-M, M], 
where £(t>) is a continuous Gaussian process such that 

E(t{v)) = -h-^"v 2 a/2 and Cov(£(t;i), £M) = h-\ Vl v 2 , 

where ? = Jq 1 tp'(u) 2 du < oo with ijj'(x) = -^^(x). 
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The immediate consequence of Theorem 3.2 is that argmax^ is a nor- 
mal random variable. Indeed, because Theorem 3.2 implies that the limiting 
process £(v) is Gaussian, it can be written as 

av) = -h~ 1 i>"v 2 a /2 + h-\ 1 / 2 vX, 

where X = N(0, 1). But £(v) is a random parabola with fixed second deriva- 
tives and a unique maximum at 

v * = (ft> a )-\V 2 X, 

since by Condition C.5, < h^,0 < tp", so that d 2 ^(v)/dv 2 = -h^ft'ce < 0. 
From here we can observe the (possible) consequences of using a weight 
function i/j(u) which does not have a parabolic structure at u = 0. The main 
implication is that if the latter were the case, E(^(v)) would not necessarily 
be a parabola as in Theorem 3.2. For example, it may be that E(^(v)) = 
C\v\, in which case not only can the argmax„^(u) be difficult to obtain, 
but more importantly it would no longer be a normal random variable. So, 
in view of the asymptotic normality achieved with a weight ip(u) satisfying 
Condition C.5, it appears desirable to employ it. Similar issues occur when 
estimating the date of a break in a regression model; see, for example, [26]. 

Now we turn our attention to the asymptotic properties of A . Note that 
Theorem 3.1 indicates that 

3.2 A° = A S + v n = X° + v n + - 

n n \n 

where v n = argmax„ £ n (t>). Then we have the following: 

Corollary 3.3. Denote ^ = s(ijj"a)~ 2 . Assuming Conditions C.1-C.5, 
as n — > oo : 

(a) //A €(0,vr) ; then (2vrA: 1 / 2 )- 1 n(A - A ) ^ Z x o = Y = N(0, 

(b) // A = 0, then (2vrA; 1 / 2 )- 1 nA Z = Y1(Y > 0), where 1(A) de- 
notes the indicator function of the set A. 

(c) //A° = 7r, then (2vrA: 1 / 2 )- 1 n(A -7r)4z^ = yj(y <0). 

We now comment on the results of Corollary 3.3. First, we now see the ne- 
cessity of Theorem 3.1, as it will give us the normalization needed to achieve 
a "proper" asymptotic distribution. Next, we observe that the limiting dis- 
tribution of A depends on whether A is {0,7r} or A € (0,7r). The intuition 
about the limiting distribution of A in cases (b) and (c) is as follows. As 
the maximization of a(\ q ) in (2.4) is restricted to the interval < \ q < tt, 
for A = 0, it implies that (A — A ) = A > so that Zq cannot take negative 
values. Similarly, A = tt implies that A — tt < and Z w cannot take positive 
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values. So, the estimation of A falls into the category of a constrained opti- 
mization problem or inequality constraint estimation. Indeed, when A is an 
interior point of the set [0, n] and due to the consistency of A , we can expect 
that the constrained estimator, X° = X q , coincides with the unconstrained 
estimator X q = X q = (2-Kq)/n, where 

(3.3) q= argmax a(X q ), 

ge{0,±l,±2,...} 

whereas if A = 0, q = qT(q > 0). Similar arguments apply when A = it. 

Once we have examined the properties of A , we next examine the esti- 
mation of a. By Theorems 3.1 and 3.2 and the functional mapping theorem, 
it is easily shown that a(A°) — a(A°) = o p (k~ 1 / 2 ). So, A; 1 / 2 (a(A°) — a) and 
k^i&iX ) -a) have the same asymptotic distribution. However, the faster 
the convergence of A to A , the slower the rate of convergence of a(X ) 
to a, and hence it becomes slower than the rate obtained when A is known. 
The same phenomenon happens to hold in [26]. Hence, to circumvent this 
drawback, as in [26], we propose a two-step procedure to estimate a. To 
this end, we shall use as an estimator of a that given in (2.3) where X q is 
replaced by A = (2-Kq)/n such that |A° — A°| = O p (k x l 2 jn), and k and k\ 
are replaced by m and mi, respectively, satisfying: 

Condition C.6. mT 1 + mm± 2 + mfm~ 3 logmi + k/m — > with m = 
en*/ 5 , < c < oo. 

In addition, to be a bit more general regarding our choice of the weight 
function ip(u), we allow for the weighted function, say w(u), to satisfy: 

Condition C.7. Jq 1 w(u) du = 0, < h w = - w(u) (log u)du < oo, w(u) ~ 
cvy- as u — > 0+ for some 1/3 < £ < 1 and for all < u\ < u-i < 1, 

\w(u2) — w(u\)\ < D\u2 — iti| , < D < CO. 
So, our two-step estimator of a is defined as 
(3-4) d(A°) = -= E^Oog/^ + log/^), 

w p=l 

where /(A) = max{/(A), re -1 } and /(A) is as in (2.1) but with the smoothing 
parameter k\ there being replaced by mi, h w = — m _1 Y^™=i w p \og(p/m) and 
w p = w(p/m). 

We now comment on d(A°) compared to d(A°) = (/i ffl m) _1 SpLi w p log fq+ p - 
Observe that the former is a "symmetrized" version of the latter a(A°). As- 
sume for simplicity that A is known. As in other semiparametric estima- 
tors, for example, [31], one source of the bias of m 1 / 2 (a(A°) — a) comes from 
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the replacement of f(X) by g(X°)\X — A°| a , which in our case, that is, if 
A y£ {0, 7r}, will be proportional to 

m 

rrT 1 ! 2 E w P {\ + 0{\ 2 p )) = O^-V 3 / 2 ). 
p=i 

The main reason for this behavior is that when A = {0, 7r}, by symmetry we 
have g'(0) = </(tt) = 0, whereas for A ^ or ir, g'(X) may not be zero so that 
g- 1 (X )\X-X \ a f(X) = l + g~ 1 (X )g , (X )(X-X ) + O(\X-X°\ 2 ) by a Taylor 
expansion of g(X) around A . Recall the comments made on Condition C.l. 
However, when the estimator d(A°) in (3.4) is employed, the contribution of 
the above approximation (Taylor expansion) to the bias of m 1//2 (a(A°) — a) 
is proportional to 

m m 

™~ 1/2 E U! p(- X p + °( A ')) + m ' 1/2 E M X P + ( X D) = Oim^n- 2 ). 
p=i p=i 

Note that the latter holds true also for A = {0,7r}. So, the "symmetrized" 
estimator a(A°) would have a smaller bias order and thus it would have a 
faster rate of convergence to a than a(A°). 

Theorem 3.4. Denote $ 2 = 2" 1 w 2 (x) dx and B = (d 2 /dX 2 log g(X )) x 
Jq u 2 w(u) du. Let A be an estimator of A such that |A° - A°| = O p {k l / 2 /n). 
Assuming Conditions C.1-C.4 with k\ = n 71 and k = re 72 , where 2t/(t 2 + 
2) < 7i < 8/15, 37i/2 < 72 < minly 7 ^-, |}, r as in Condition C.2, and Con- 
ditions C.6 and C.7, then 

(a) (2m) 1 / 2 (a(A s ) - a) 4 N(4n 2 c 5 / 2 B/(2 1 / 2 h w ), $ 2 /h 2 w ), 

(b) (2m)V2(a(A0) - a ) A N(4tt 2 c 5 / 2 B /(2 1 / 2 h w ), <S> 2 /h 2 J. 

Remark 3.1. It is worth mentioning that the results of Theorem 3.4 
hold true if the weight ip(u) employed to estimate A is used in a(A°). 
However, this weight will not guarantee an asymptotic variance smaller than 
1, as is the case with the weight used in the Monte Carlo experiment. In fact, 
for the weight used in the Monte Carlo experiment, /i~ 2 $ 2 ~ 0.70, which is 
smaller than the corresponding asymptotic variance of other estimators of 
a suggested in the literature. Finally, the theorem indicates that although 
any preliminary estimator of A which satisfies |A° — A°| = O p (k 1 ^ 2 /n) is 
adequate for the results to follow, in practice it appears that one may use 
that given in (2.4) for computational simplicity. 

Theorem 3.4 provides a consistent estimator of the asymptotic variance 
of A , that is, \I/ in Corollary 3.3, by replacing a by a(X°). But more impor- 
tantly, it indicates that the two-step estimator a(A°), apart from having the 
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same asymptotic distribution as d(A°), achieves the optimal semiparametric 
rate of convergence obtained by Giraitis, Robinson and Samarov [14] when 
A = 0. So, asymptotically, there is no loss by using A instead of A . How- 
ever, to achieve the latter, as in other nonparametric estimates, d(A°) will 
have a bias term of the same order of magnitude as the standard deviation. 

4. Finite sample behavior. In this section we study via Monte Carlo 
analysis the finite sample performance of the estimators A and d(A°). The 
models employed throughout the simulations are 

(4.1) {1-L) a ' 2 x t = e t , t = 0, ±1 

(4.2) (l-2cos(vr/2)L + L 2 ) a/2 x i = e t , i = 0,±l, 

where {et} is a zero-mean sequence of i.i.d. Gaussian random variables. 
Model (4.1) generates a pole at A = 0, whereas model (4.2) does so at 
A = 7r/2. We have chosen a = 0.2, 0.4, 0.6 and 0.8. The autocorrelation func- 
tions of (4.1) and (4.2) are given by 

Pi= j -a/2 V- 1 ' J = 1 ' 2 "--> 

and 

1-3 -a/2 n ■ ! „ 

P2j= j_ a / 2 ^(i-i)' P2j-i = 0, j = 1,2,..., 

respectively; see, for example, [3]. For each combination of a and A , 2500 repli- 
cations of series of lengths n = 256 and 1024 were generated by the method 
of Davies and Harte [8]. 

Also, we have compared the performance of A and ct(A°) with the cor- 
responding estimators obtained using the log-periodogram estimator of [31] 
popular among practitioners. That is, consider A = Xg = (2tt q)/n, where 

(4.3) q= argmax d L oG(A g ), 

9=0,..., [ra/2] 

/ k \ _1 k 

(4.4) d L OG(A g ) = - 2^^-logi ^^(log/j+g + logVj), 

V j=i I j=i 

with (fij = logj — k~ l X^=i log^. Moreover, we have examined the behavior 
of the estimator dLOG(A°) of a, where k = m in (4.4). For the estimation 
of A , the chosen bandwidth parameters were, for n = 256 and 1024, k = 
14 and 24, respectively, and k\ = A;°' 6 loglog2A;, whereas for the two-step 
estimators d(A°) and dLOG(A°) of a, we have chosen m = n/4 and mi 



m 



0.6 



log log 2m. The weight functions used were ip(u) = —u 2 + 35n 2,5 /6 



29ti 3 /6 + 2u 3 logn and w(u) = u 1 / 3 — 9w 1 / 2 /8, respectively. 
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Table 1 illustrates the bias and standard deviation of the estimators A 
given m (2.4) and A in (4.3). More specifically, since A = (2nq)/n and 
A° = (2nq)/n, we have reported the bias and standard deviation of q and q. 
Table 2 summarizes the bias, standard deviation and mean square error 
of d(A°) and d(A°). The motivation to include d(A°) is to investigate the 
relative loss we incur by lack of knowledge of A in small samples. Recall that 
Theorem 3.4 indicates that asymptotically there is no loss. Moreover, Table 2 
illustrates the finite sample performance of the corresponding estimators of 
a obtained using the log-periodogram estimator in (4.4), that is, dLOG(A°) 
and aLOG(A°). 

Inspection of Table 1 indicates better performance of A than A across 
different models and sample sizes, especially for a > 0.2. For example, when 
a = 0.8, the finite sample performance of A is clearly superior to that of 
A , this superiority being greater with the sample size. With regard to the 
estimators of the memory parameter a, we observe that the proposed two- 
step estimator a(A ) outperforms cklog(A ) and has better finite sample 
properties for all a and A . In some cases, the performance of «log(A°) 
is very poor compared to that of d(A°), especially for large values of a. 
Finally, when comparing their performances with the estimators obtained 
when the location of the pole A is known, we observe that the relative 
loss of efficiency of &(A ) is smaller than that of ctLOG^ )• Moreover, as 
Theorem 3.4 indicates, it appears that knowledge of A is not relevant to 
estimate a when d(A°) is used, although it seems not to be the case when the 
log-periodogram is employed. Altogether, we can conclude that A and d(A°) 
enjoy better finite sample properties than the corresponding ones based on 
A° and d L OG(A°). 

Table 1 

Bias and standard deviation of q and q 



a 





0.2 


0.4 


0.6 


0. 


8 


A = n 256 


9.35 


(8.33) 


6.38 


(6.96) 


4.24 


(5.39) 


2.80 


(4.04) 




9.26 


(7.88) 


7.32 


(6.85) 


5.94 


(6.01) 


4.85 


(5.25) 


1024 


15.40 


(15.50) 


8.43 


(10.74) 


4.81 


(7.64) 


2.62 


(5.76) 




22.91 


(25.31) 


15.55 


(20.89) 


9.60 


(14.02) 


6.73 


(9.96) 


A = § n 256 


0.003 


(7.64) 


-0.084 


(5.33) 


-0.091 


(2.96) 


-0.054 


(1.56) 




0.209 


(9.59) 


0.270 


(9.21) 


0.272 


(8.66) 


0.320 


(7.28) 


1024 


0.051 


(11.87) 


0.117 


(4.77) 


0.063 


(1.89) 


0.216 


(1.13) 




0.435 


(27.89) 


0.144 


(25.77) 


-0.213 


(21.30) 


-0.060 


(13.52) 



The first row in each cell corresponds to q, whereas the second row is that of q. 



Table 2 

Bias, standard deviation and MSE of the long-memory parameter estimators 





n 






0.2 






n i 

U.4 








u.o 






U.o 




BIAS 


S.D. 


M.S.E. 


BIAS 


S.D. 


M.S.E. 


BIAS 


S.D. 


M.S.E. 


BIAS 


S.D. 


M.S.E. 





256 


-0.020 





.064 


0.004 


-0.022 


0.067 





,005 


-0.017 


0.071 


0.005 


-0.006 


0.072 


0.005 






-0.019 





.057 


0.004 


-0.030 


0.065 





,005 


-0.024 


0.074 


0.006 


-0.006 


0.075 


0.006 






-0.001 





.089 


0.008 


-0.003 


0.089 





,008 


-0.003 


0.089 


0.008 


-0.007 


0.082 


0.007 






-0.015 





.084 


0.007 


-0.043 


0.090 





,010 


-0.064 


0.099 


0.014 


-0.079 


0.105 


0.017 




1024 


-0.006 





.024 


0.001 


-0.003 


0.025 





,001 


0.007 


0.031 


0.001 


0.026 


0.031 


0.002 






-0.015 





.030 


0.001 


-0.014 


0.035 





001 


0.002 


0.040 


0.002 


0.032 


0.045 


0.003 






-0.002 





.042 


0.002 


-0.003 


0.042 





,002 


-0.005 


0.042 


0.002 


-0.005 


0.042 


0.002 






-0.022 





.045 


0.003 


-0.039 


0.054 





,004 


-0.046 


0.059 


0.006 


-0.051 


0.066 


0.007 


7T 

2 


256 


-0.020 





.055 


0.003 


-0.035 


0.059 





,005 


-0.041 


0.064 


0.006 


-0.040 


0.070 


0.006 






-0.010 





.046 


0.002 


-0.020 


0.053 





003 


-0.004 


0.062 


0.004 


0.043 


0.059 


0.005 






0.002 





.094 


0.009 


0.000 


0.094 





,009 


0.000 


0.093 


0.009 


0.005 


0.084 


0.007 






-0.050 





.098 


0.012 


-0.083 


0.121 





,022 


-0.100 


0.156 


0.034 


-0.083 


0.182 


0.040 




1024 


-0.012 





.022 


0.001 


-0.015 


0.024 





,001 


-0.007 


0.028 


0.001 


0.014 


0.034 


0.001 






-0.014 





.018 


0.001 


-0.017 


0.020 





,001 


0.003 


0.024 


0.001 


0.044 


0.035 


0.003 






-0.002 





.038 


0.001 


-0.004 


0.038 





,001 


-0.006 


0.038 


0.001 


-0.007 


0.038 


0.001 






-0.039 





.046 


0.004 


-0.064 


0.069 





,009 


-0.061 


0.096 


0.013 


-0.023 


0.097 


0.010 



The first row in each cell corresponds to the estimator d(A°), whereas the second, third and fourth correspond to the estimators a(\°), 
olog(A°) and aLOG(A°), respectively. 



e-i 

K 

B 

> 

r 

O 
O 
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5. Auxiliary results and proofs. We begin with the proof of Theorem 3.2. 



5.1. Proof of Theorem 3.2. Let t = -[vk l / 2 \. We examine the case t > 0; 
that for t < is similarly handled. First, since Condition C.5 and Lemma 6.10 
imply that — h^\ = 0(k^ 1 ), we have by the definition of a(X q ) that 



Lm l/2 ) = 2- 1 v( E&°(*) J c 1 + 

after observing that Cnit/k 1 ^ 2 ) =£, n ( v ), and where 



k 



^ 1) (*) = -«E^ 1 °g(b-*l+/p). 
P =i 



k 

?(2) 



4 2 )(t) = -a^^log((p + t)/p), 
p=i 

p=l ^ Jp+sAp ' 

p=l ^ Js-pAp ' 

p=l ^ Jp+sJp+s ' 



where f t = (2k x + 1) 1 E^i-j^ /(j+£)x(j+^ s )+(s+i)X(j+£= s ) and |qr| + = max(|g|, 
We examine the behavior of £n(t), for i = 1,. . . ,6, in four propositions. 
Specifically, Propositions 5.1 and 5.2 deal with the limiting bias of J2i=i £n\t), 
although for the proof of Proposition 5.1 we will allow t < pk for < p < 
1. Proposition 5.3 examines the finite-dimensional limiting distribution of 
£n\t) +£n(t) and Proposition 5.4 its tightness. Propositions 5.1-5.4 imply 
Theorem 3.2. 



Proposition 5.1. $\t) + $\t) = -&'a% + 0$ + £&). 
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PROOF. We only examine £n\t), £n(t) being identically handled. As- 
sume p < 1/2 first, so that < t < k/2. Then 

t k 

(5.1) i^(t) = -aJ2^og(\p-t\ + /p)-a ^og(( P -t)/p), 

p=i p=t+i 

where the first term on the right-hand side is 0(k~ 2 t 3 ) because \tp p \ < 
Dp 2 jk 2 by Condition C.5 and the integrability of |u 2 log((l — u)/u)\. Next, 
the second term on the right-hand side of (5.1) is 

2t 

-a V>plog((p-*)A) 
P=t+i 

k~t k 

- a (i/} p+t -ip p )log(p/k) +a ^ ij) p \og(jp/k). 

p=t+l p=k-t+l 

Proceeding as with the first term on the right-hand side of (5.1), the first 
term of the last displayed expression is 0(k~ 2 t 3 log(k/t)), whereas the last 
term is 0(k~ 2 t s ) by Taylor expansion of if)(u) around u = 1, noting that 
Condition C.5 implies that V(l) = and that J2p=k-t+i I l°g(f?A)l = 0(t 2 /k) 
by Taylor expansion of log(x) around x = 1. Finally, the second term of the 
last displayed expression is 

(5.2) -a- ^ ^(P/ k )-2lk £ < lo « + °U 

p=t+i p=t+i ^ " 

by integrability of |logu| and the fact that ip"(u) is Lipschitz continuous of 
order 1/2 by Condition C.5. By Lemma 6.10 and Condition C.5, the first 
term of (5.2) is 

-atj\\u){\ogu)du + o(^+o(^-(^+ £ }l^>g(pA)l)) 
= at f ^(uju' 1 du + 0(k~H(l + t 2 k~ l log(k/t))), 



noting that Condition C.5 implies that ijj(u) (log u)\q = and \ip' p \ < Dp/k 
and then proceeding as above. On the other hand, the second term of (5.2) 
is 

-|^V» W(log „) <i „ + (^„ g (^)=- a §! + (^o g ^ 
so that, because \k~ l / 2 t 1 / 2 \og(k/t)\ < D for < t < k, we conclude that 
II 1 ) (t) = at £ ^{u)u- 1 du-J^ + O^ + ^y 
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Now, when 1/2 < p < 1, so that k/2 <t <k, the proof is identical since in this 
case the left-hand side of (5.1) is —aJ2 P =i' l Pp^og(\t—p\ + /k) — aY.pZii^p+t — 
ij)p)\og{jp / k) + aJ2p=k-t+i ^pl°§(p/fc)' Then proceed as above. Proceeding 

similarly, £n\t) = —at Jq <^ n ~~ a ^W~ ^(s + PTz) - From here the 

conclusion is obvious. □ 

Proposition 5.2. |i 3) (t) + £ { n\t) = o(l). 

Proof. We only examine £n\t), as £ri\t) is similar. By definition 

i { n\t) is 

2fci fc fe 

(5.3) -J2i) p a p - ^pO-p-^Z^pdp, 

p=l p=2fci+l p=l 



where 



l°g(/(p+ s )I(p^ )+(s+l)X(p=0)^P+ s ) 
- lo g(/(p+ s _t)J(p^t)+( s +l)J(p=t)/p+s-t) 



and 



5p = l°g(Ap /(p+s)X(p^O)+(s+l)X(p=0)) 

- log(Aj^_ t | + f(p+ s -t)X(pft)+( s +l)X(p=t) ) • 

Since by Condition C.l and |A° — A s | < D'^h/p) < \%J P+S < D{k x jp) c 
it implies that, for p < 2k±, \a p \ = Oi\og{k\/p)) by Lemma 6.1. Note that 
at = 0(log k%). Hence the absolute value of the first term of (5.3) is bounded 
by 



£>|&[log(fci) + Z> £ |^[log(^)=o(j|f)= (l) 



by Condition C.4 and because Condition C.5 implies that \ip p \ < D(p/k) 2 . 
The absolute value of the second term of (5.3) is bounded by 

fciptog^sfci] k 
D Y \ip P a p \+D ^ l^pOpI 

p=2fci+l p=fc 1 [log 1 / 3 fci]+l 

= {^ + A 1+log {i))) =o{1) > 

where for the first term on the left-hand side we have used the fact that by 
Lemma 6.1(a), D^ 1 < \ f~+ s f p+s \ < D and then Condition C.5, and for the 
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second term on the left-hand side the fact that by Lemma 6.1(a), \fp+ s fp+s — 
1| = 0(p~ 2 k 2 ), which implies that | log(f p + s f p+s )\ = 0(p~ 2 k 2 ) by the mean 
value theorem, and then Lemma 6.4 with v p = k^ 2 \og(f~+ s f p+s ) there. 

To complete the proof, it remains to show that the third term of (5.3) is 
o(l). By Condition C.l, Ylp=i 4>p9p 1S 

k 

^(^^(Vm)) - log(g(A p+s _ t ))) 

P =i 

-aJ2^og(\X p+s - A ^; 1 ) - log(|A p _ t+s - A°| + A b i t|+ )). 
P =i 

Denote the first and second derivatives of log(g(A)) by /i(A) and h'(X), re- 
spectively. The first term of (5.4) is 

(27r£ \ ^ / *27rt \ 2 \ 

p=i p=i 
Am 2 t k 

n P =i 

where 9(p) € (0, 1), by Condition C.4 and because Lemma 6.10 and Condi- 
tion C.5 imply that k^ 1 J2p=i V'p = 0(k~ 1 ), so that 

k k 

]T ^ p h{\ p+s ) = ]T MHK+s) - h(\ s )) + o{\) 
p=i p=i 

= —h'(X s ) V p^ p + 0(k 3 n~ 2 + 1). 
p=i 

Next, the second term of (5.4) is 



-a- 



' 2i k \ 

J2+ E k(iog((A p+a -A°)v 1 )-iog(IVw-«-A°|+Aj;L t | + )). 

.p=l p =2t+l) 



The contribution due to J2 P =i i s by Condition C.5 and then Condi- 
tion C.4, whereas the contribution due to J2 P =2t+i^ by Taylor expansion of 
log(x), is 



."A, -A 

-an 



E i-J--—) 
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a,-a°\ 2 * r i i 



+ 2 n VV ) J2 +i ^\ p 2 (s) p 2 [s _ t) 

= Oitk" 1 log k) + Oik" 1 + tk~ 2 log k) = o(l), 

where p(s — a) is an intermediate point between p — a and (p — a) + n(A s — 
A°)/(27r), and then because n|A° — A s | < 7r, \p~^ p \ < Dk~i for j = 1,2, by- 
Condition C.5 and the fact that \p 2 p~ 2 (s)\ + |(p — t) 2 p~ 2 (s — i)| < D. So, we 
conclude that 

&\t) = -^ti(\ s )j2p^ P + o(i). 

p=l 

Similarly, £n\t) = 47r 2 th'(\ s ) J2p=iP^Pp/ n2 + From here the conclusion 
of the proposition is obvious. □ 

Proposition 5.3. The finite- dimensional distributions of£n (t)+£n \t) 
converge to those of a normal random variable. 

Proof. By the Wold device, it suffices to show that, for any finite I > 0, 

i=l \ i,j=l / 

where (pi satisfies J2i=i l^l 2 = 1 an d fi = limij/A; 1 / 2 . Denoting f^ l fe by g£, 

2t k 

"o+s-t) 



in ) i t ) = ~Yj l °g(9p+s/g P +s-t) ~ V»p log(5p+s/5 P H 



p=l p=2t+l 

(5.5) 

:= bit + ht- 

We begin by showing that bit = o p (l). In particular, we will show some- 
thing stronger than needed, that is, that for each e > there exists no such 
that 



(5.6) 



Pr( sup \h q - b Ul \>e\< DM^k^^e^ 1 log" 1 ki 
for all n > n and < t\ < t 2 < [/c 1/2 M]. Because t 2 < [k 1/2 M], 

2[fc 1 /2 A /] 

sup \bi q -b Ul \<2 sup |log5 9+s | \ip p \ 
h<q<t2 0<|?|<2[feV2M] p =i 

<D sup llog^+slAf 3 ^- 1 / 2 , 

0<\g\<2[k 1 / 2 M] 
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since Condition C.5 implies that J2p=i IV'pl < Da jk . Then by Markov's 
inequality the left-hand side of (5.6) is bounded by 

DM 3 k~ 1/2 e~ 1 E sup | \ogg q+s \ < DM 3 k~ 1 ^ 2 e~ 1 log^ 1 k\, 

0<\q\<2[k 1 / 2 M] 

because Lemma 6.1 and the definition of f q + s imply that g q + s is bounded 
from below by D~ 1 n~ h , for some h>l, and for x > D~ 1 n~ h , 



(5.7) 



log x — (x — 1) I < D(x — l) 2 log n, 



by Condition C.4, | logra/log/ci| < D, and the fact that by Lemma 6.3 and 
Condition C.4, for some j3 > 0, 



(5.8) 
(5.9) 



E[ sup Isg+s-ll'* <— - 



q:\q\K2kx 



D 



E[ sup |5g +s -ir )<Tmi- 

\q:2ki<\q\ J fcf 



log i+ ^ fci 
D 



So (5.6) holds, which implies that b\t = o p (l). 

Next we look at the second term on the right-hand side of (5.5), that 
is, &2t- Denoting 

k-t 

(5-10) b 3t = Wp+t ~ ^p)log(5 P +s), 

p=2t+l 

we have that 624 — &3t is 

2t fc 

(5.11) ip p+t log(g p+s ) - Y ^P l °s(9 P +s)- 

p=t+l p=k-t+l 

Because by Condition C.5, {£ 2t =t+1 \^ P +t\ + EjLg+i hVfJ} < Dk- 2 {t 3 + 
<? 3 ) < DM 3 ^ 1 / 2 , the first term of (5.11) satisfies 



Pr< sup 

I ti<q<h 



(5.12) 



2ti 2g 

Y ^P+ti l °S(9p+s) ~ Y ^P+q l °S(9p+s 
p=t\+l p=q+l 



> e 



< 



DM 3 



ek 1 / 2 log k\ 

by Markov's inequality and (5.7)— (5.8), whereas the second term satisfies 

k k \ 



Pr< sup 

I h<q<t 2 



J2 - J2 )^piog(5p+s) 

>p=fc — ti + 1 p=k — q+1/ 



>e < 



DM /t 2 — t 



elogfcA fc 1 / 2 
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by (5.7) and (5.9) and because EjU-g+i - T,p=k-ti+i = T, k p =L q +i and W 
Condition C.5, sup tl < g < t2 Ej=t g+ i hM < EjdU W < £>& _1 (*l " 4) < 
DMk~ l / 2 (t2 — h). 

So, (5.6), (5.12) and the last inequality imply that for any e > there 
exists no such that for all n > no , 

Pr{ sup |(I4 5) (Q) - &3 9 ) - (el 5) (*i) - &3 tl )| > ^ 

Ui<g<t 2 

(5 - 13> ™ „„ , „ 

Clearly (5.13) implies that sup 0<i<2 [ A ,i/ 2M ] \in (t) — &3t| = o p (l). Proceeding 
similarly, we have that for any e > there exists no such that for all n > no , 

Pr{ sup \(^)( q )-b 4q )-(i^(t l )-h tl )\>s} 

Ui<q<t 2 J 

<5 ' 14) ™ , u 

where 

k 

(5.15) 6 4 <= X! (^p~* ~ <Pp) l °g(9s-p)- 

p=3t+l 

Next we examine b-^t and 644 . Denoting $1 = gg — 1 and writing 

fc-t _ k 

(5.16) 6 t = E (V'p+i - ippj&p+s, h= E (ipp-t ~ ip P )$s-p, 

p=2t+l p=3t+l 

Lemma 6.5 implies that b% t = bt + t/k l l 2 o p {l) and 644 = &t + t//c 1 / 2 o p (l), 
where the o p (l) term is uniformly in t < pk, for p < 1/3. 

So it remains to examine bt and 6^. By Taylor expansion of tp p , 
(5-17) b t = l g ^ + ^ E ^(f + 4W, 

where 5 = 5(t) G (0, 1). The first term on the right-hand side of (5.17) is 
, k-2k! , r 2fci fc-t ~\ 

I E + r E + E k<w 

p=2fci+l lp=2t+l p=fc-2fci+lJ 

By Lemma 6.2 and Conditions C.4 and C.5, the second term of the last dis- 
played expression is clearly tk\' 2 /kOp{l + k- l k* +l/2 Z{a> 1/2)) =t/k 1 l 2 o p (l), 
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where the o p (l) term does not depend on t < pk. On the other hand, writing 
7]j = f~+ s Ij+ s — 1, the first term is 



(5.18) 



, fc-2fci // fci 

k ^2^+1.^.^ 

p=2fei+l j=—ki 

I fc— 2fei / / fci / ^ \ 

ft p=2fc 1+ l //Cl + 1 j=-k! V J?H-» ' 



Since \f p + s f p + s - 1| = - / p + s /p+ s | = 0{k\jp 2 ) by Lemma 6.1, 

l/p+s/p+j+s ~~ 1| — -^^T^ Condition C.3, and by an obvious extension of 

Robinson [32], E\ J2jL-kx c jVp+j\ = 0{k^ ) for any \cj =c(j/ki)\ < D, we 
obtain that the first absolute moment of the second term of (5.18) is by 
Condition C.5 and then Condition C.4, tk- l l 2 0{k\ /2 k- 1 / 2 £j =1 \i// p \/p) = 

t/k l l 2 o{l), where the o(l) term does not depend on t < pk. 

On the other hand, after rearranging subindices, the first term of (5.18) 

is 

, fe-2fci / ^ 2fci \ 




— — E ^'j+ki 

1 ^ X 7=1 / 



1 



J-- 

k—ki 



2*1 + 1? ^" fcl 



After standard calculations and routine application of Robinson's [32] The- 
orem 2, the last two terms are t/kO p {k\ /2 ) = t/k^ 2 o p (l) by Conditions 
C.4 and C.5, whereas the first term of the last displayed expression is 

2fci 



, fe-2fei , fc-2fci / ^ 

t E + z E 



2fci + l|^ +j 



1 



We note that V^ 11 ) continuous by Condition C.5 implies that ip' p+ j/ip p — ► 1 
as /ci/p — ► 0, and hence the expression inside the parentheses converges to 
zero as p^ oo. So by Toeplitz's lemma we conclude that the last displayed 
expression, and therefore also the first term of (5.17), is 



k X/2\ k l/2 

\ p=2fei+l 



E ^p+o P (i) 
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Proceeding similarly as with the first term on the right-hand side of (5.17), 
the second term of (5.17) is k'^H 2 ^- 1 / 2 Ejl^+i ^"(f + &{)V P + Op(l))> 
so that 

(5.19) fc= ( 53 ^ p + _o p(1 ) + 0p(1) , 

\ p=2fci+l / 

where the o p (l) and 0»(1) terms are uniform in t < pk. Similarly, we obtain 
that 

(k—2ki \ 
m £ ^' p V- P + {o p (l)+o p (l) , 
p=2fci+l / 

where the o p (l) and O p (l) terms are uniform in t < pk. 

Thus, (5.13), (5.14), Lemma 6.5, (5.19) and (5.20) imply that, for U < 
[Mk 1 / 2 ], 

£&(& B) fe)+& 8) fe)) = ^iW 1 ) 

i=l i=l 

(5.21) 

,i / < — , 

(f>i<j)jViVj 



\ i,7=l 



by Robinson's [32] Theorem 2 and Toeplitz's lemma, since \k 1 X^ = ' +1 (V4) 2 ~ 
?| = o(l) by Lemma 6.10. □ 

Proposition 5.4. |i 5) (t) and i£\t) are tight. 

PROOF. Write c t = £n (i) — &3t, where 634 is given in (5.10). To show that 
(i) is tight it suffices to show that q and b^t are tight. Since the finite- 
dimensional distributions of q converge to zero [cf. (5.13)], Billingsley's [4] 
Theorem 15.4 implies that c± is tight if for each e > and v > there exists 
a <5 G (0, 1) such that 

(5.22) Pr{0"(ct, 5) >e}<i/ 

holds for all n > no, where 

■&"{ct,8) =supmin{|Q - ctj, |q 2 - q|}, 

and the supremum is over t±,t and £2 satisfying ii < t < £2 with £2 — £1 < 
^[fc^M] and 5 G (0, 1). Observe that we can assume k 1 I 2 < [i^/Ar^M] — 
[ti/fc^M]. If [t 2 /fc 1/2 M] - [h/^^M] < AT 1 / 2 , then either t x and t lie in 
the same subinterval [{p— l)/M,p/M) or else t and £2 do; in either of these 
cases the left-hand side of (5.22) vanishes. 
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Inequalities (14.9) and (14.46) in [4] imply that (5.22) holds if 

Pr{0(ct, J) >e}<i/, 

for some < S < 1, where 

■&(ct,S)= sup \c t -c v \. 

\(t-v)/(k 1 / 2 M)\<8 

(Observe that as c± converges in probability to zero, which has continuous 
paths, the Skorohod metric can be replaced by the uniform topology.) By 
the corollary of Billingsley's [4] Theorem 8.3, it suffices to show that 



(5.23) 



J^Pr^ sup \c v - c t% _ 1 1 > e/3 > < y, 



where 2~ 1 8 < [k^M]' 1 ^ - U-i) < 6 and = to < ti < • • • < t r = [k l / 2 M\. 
But this is the case since by (5.13), 

t. f , 1 DM S S 
Pt< sup \c v - c t ^ 1 \> e/3 } < — — . 

Now choose no such that DM^e~ l log -1 k\ < v since r < 2[5~ 1 ] to obtain (5.23). 

Proceeding similarly, but using (5.14) instead of (5.13), & 6) (t) - b 4 t is also 
tight. 

Next we show the tightness condition for b^t the proof for ba is similar 
and is omitted. Consider t < q. Then b^t — bs q is 

k—t k—q 

(ipp+t-ip P ) log g p+s - ~ ^p) 1o S9p+s 

p=2t+l p=2q+l 

{2q k-t ~| 

E + E \(^ P +t-^p)logg p+s 
p=2t+l p=k-q+l) 

k—q 

+ E (^P+t-tPp+qjtogQp+s- 
p=2q+\ 

The first term on the right-hand side of (5.24) is tight, as we now show. 
Because by Condition C.5, 

2q k-t 



\ip p+t - ip p \ < Dk H, 



E x + E 1 

p=2t+l p=k-q+l 



<3(q-t), 



abbreviating the first term on the right-hand side of (5.24) by Ct,q, we obtain 
that 

^Pr( sup iCti-Lgl >g| <Y. F A D ~ SU P \ l °&9 P +s\>e\ 

~{ Ui_i<g<ti J I K i p=2t+l,...,fe J 
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e log k\ 

by Markov's inequality and (5.7)-(5.8) with \i = 1 there. Then choose no 
such that DMe~ 1 log~ 1 k\ < v to complete. 

Next, Taylor expansion implies that the second term on the right-hand 
side of (5.24) is 

k-q -i /a. > 2 k-q 



(5.25) 



k 



V "V if -I a 1/t 

2^ V P+i loggp +s + - 

p=2 g +l 



A: 



p=2q+l 



where £ is an intermediate point between t and q. 

The second term of (5.25) is tight as we now show. Proceeding as with 
the proof of tightness of q, it suffices to show that for all v and e > there 
exists no such that 

2 k ~Q 



E Pi 

i=i 



sup 

U-!<q<U 



q r 1 ) E \^p+e l °S9 P +s 

ft ' p=2g+l 

for all n > n and < t < ■ ■ ■ < t r < [k l / 2 M\. But by (5.7) and (5.8) and 
the fact that |^"(ii)| < D by Condition C.5, the left-hand side of the last 
displayed inequality is bounded by 

r r 

De^k^ 1 log" 1 fcij^j - ^-i) 2 ^ DM 2 e~ 1 log" 1 jfe! J2 52 < "> 

i=l i=X 

since r < 2[<5~ 1 ] and 2~ l 5 < [k 1 ' 2 M]~ l (ti - < 5. 

To finish the proof it remains to examine the first term of (5.25), denoted 
by dt,q- Since from the proof of Proposition 5.3, the finite-dimensional dis- 
tributions of dt t q converge to those of the limiting Gaussian process which 
has continuous paths, by Billingsley's [4] Theorem 15.4, it implies that it 
suffices to check that 

Pr{0"(dt, ff , 5) >e}<i/ 

for some n > uq. Now by Billingsley's [4] Theorem 15.6, it suffices to check 
the moment condition 

_ „ #2 



(5.26) 



k 1 / 2 



for t <q <v and some /3± > and /?2 > 1- Write 

k—q 



d 



t,q 



q-t 

k 



(5.27) 



E vv +i (ftH 

P=2<?+1 



1) 



+ 



q-t 



' 3fei k-q \ 

E + E Wp+ti 1 -9 P +s+ log < 

,p=2<?+l p=3k 1 +l) 
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Using (5.7), the first absolute moment of the second term on the right- 
hand side of (5.27) is bounded by 



D 



q-t 




log n| 


k 






p+t 



p+s 



1Y 



<D\q 



'k 2a 1 
t\logn[ -^-Z(a > 1/2) + — 



< D 



< D 



t-q 



fcV2 

t-q 



logn 



V ki 



+ 



ki 

fc 2a A .(l+Q/2 
P 



J(a> 1/2; 



for some < £ < 3(1 — a) by Lemma 6.2 and observing that J2p=2q+i 



< 



Dk\k~ l = o(k 1 / 2 ) by Conditions C.4 and C.5. So, the second term on the 
right-hand side of (5.27) satisfies (5.26), which follows by the Cauchy- 
Schwarz inequality, choosing fa = 1/2 and /?2 = 1 + £, and the fact that 



(q-t)(v 
in (5.19), 



< (v — t) . Finally, proceeding as with the proof of bt given 



E 



k 



p=2q+l 



^'p+t(9p 



<D 



q-t 

k l/2 



which implies that the first term on the right-hand side of (5.27) satis- 
fies (5.26) choosing (3± = 1 and fa = 2, and noting that {q — t){v — q) < 
(v — t) 2 . So, we conclude that 634 is tight. Proceeding similarly, 644 is also 
tight, which completes the proof. □ 



5.2. Proof of Corollary 3.3. (a) By Theorem 3.2, i n (v) w ^±/ £(^) i n 
D[— M, M] for any arbitrary M > 0. Next, since the limiting Gaussian pro- 
cess £(v) has continuous paths, that is, it belongs to C[— M, M], the Skoro- 
hod metric can be replaced by the uniform topology. On the other hand, by 
Eddy [10], the argmaximum is a continuous functional in the set of parabo- 
las with fixed second derivatives in C[—M,M\. So, by van der Vaart and 
Wellner's [34] Theorem 3.2.2, we obtain that 

v n = argmax£ n ,(f ) argmax£(w) = v* , 

V V 

where v* = ^ l l 2 X and X = N(0, 1). Observe that Theorem 3.1 shows that 
Pr{|t) n | < L} > 1 — 5 for n sufficiently large. This together with Problem 1.3.9 
in [34], page 27, implies that v n is uniformly tight. 
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But by construction, A = A + n~ l (2itk l / 2 )v n + Ofra" 1 ), that is, (3.2), 
and hence 

(27rA; 1 /2)-i n (A0 _ A o } = ^ + ± v * = ^i/2 X 

(b) As in (a), the limit process is £(v), where from the definition of A , 
v > 0. Thus, if X takes a positive value, the restriction v > is not binding 
and the maximum of £(v) is achieved at v* . However, when X takes a 
negative value, the restriction is binding and thus the maximum is at v = 
due to the parabolic structure of £(v). 

(c) The proof is identical to part (b) once the wording positive (negative) 
is replaced by negative (positive). □ 



5.3. Proof of Theorem 3.1. Similarly to the proof of Theorem 3.2, it 
suffices to show the theorem with replaced by . With that replacement 
and recalling that g p = f p 1 f p , a(X q ) becomes 

h~ l k 
P =i 



(5.28) 



h~ l k 

+ ^E^p log(Af p+g _ s|+ / p+(? ) 
P =i 

+ ^E^log(Af 9 _ p _ s|+ /,- P ) 
p=i 

h' 1 k 

~ a ~^T ^p( lo g( 27r IP + 1 - s \+/ n ) + l °g( 27r \q -p- s\+/n)). 

Because (5.7)-(5.9) imply that sup^ =0: ... t \ n /2] I ^°S9i\ = °p(1)> we then have 
that, uniformly in q, the first term of (5.28) converges to zero in probability. 
Next, consider the second term of (5.28). (Recall that < q < [n/2].) If 
q < max{0, s — [k\ log k±] — k] or s + [k\ log ki] < q, this is 



(5.29) 



g(\) \ 



because Lemma 6.10 and Condition C.5 imply that k~ 1 J2p=i' l Pp = 0(k~ l ) 
and by Condition C.l, f p = \X P — \°\~ a g(X p ). But by Lemma 6.1(a) and 
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Taylor expansion of log(z) around z = 1, (5.29) is bounded in absolute value 

The contribution due to the second term inside the brackets is easily shown 
to be o(l), as is the contribution due to the third term by Taylor expansion 
and the fact that n\X s — A | < it. Finally, by the mean value theorem, the 
first term inside the brackets is bounded by 

k i n \n J 

p=i 

because by Condition C.l, g(X) is continuously differentiable. Next, when 
s — [ki log k\] < q < s + [ki log fei], the second term of (5.28) is also o(l), since 
there are at most 0([k\ logfci]) terms such that \p + q — s| < [k\ \ogk\\, and 
hence by Lemma 6.1, D^ 1 log k\ < log(/ p+(? {A^ +(? _ s | + + fp+ q }) < D, whereas 
for the remaining ones \p + q — s\ > [k\ log k\], so that proceeding as before, 
it will be o(l) by Condition C.4. Thus, we conclude that the second term 
of (5.28) in this region is 0(k~ 1 k\ log k\) + o(l) = o(l) by Condition C.4. 
Similarly, when max{0, s — [k\ log k\] — k} < q < s — [k\ log k\] we obtain that 
the second term of (5.28) is also o(l). Proceeding as with the proof of the 
second term of (5.28), it follows that the third term of (5.28) is o(l) uniformly 
in q. 

Using Jq 1 tp(x) dx = and Lemma 6.10, we conclude that 

sup \a(\ q ) - l6 +>n (\q) - i<5_ n (A ? )| = o p (l), 

0<q<[n/2] 

where S + , n (\ q ) = -ah^k' 1 J2p=i ipp^g(\p+q- s\ + /k) and 5- jn (X q ) = -ah^ l k~ l Y%=i Vv lo g(l<?- 
p-s\ + /k). 

We now examine the properties of S +in (Xq); those of 5- )Tl (X q ) are handled 
similarly. First, by Lemma 6.10, 




Now, for arbitrarily small p > 0, sup p -i fc< | g _ s | 5 +:n (X q ) < Dp since by Taylor 
expansion, 

sup \\og{\q - s\ + ) -\og(\±p + q - s\ + )\ <Dp. 

p~ 1 k<\q—s\ 

Next, by Proposition 5.1, sup\ q _ s \ <pk 5 +! n(X q ) — a < —Dp 2 , whereas since 
5 + , n (X q ) is a nonincreasing function in \q — s\, sup pfc< | 9 „ s | <p -i fc 5+ >n (X q ) — a < 
-Dp 2 . 
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Pr sup (A n (t)-A n (0))>0 ->0, 
V|t|> P fc / 

that is, A is a consistent estimator of A . Thus, to complete the proof of 
the theorem, it suffices to show that for any e > 0, there exists L > such 
that 



(5.30) 



Pr 



sup _(A n (i)-A n (0))>0) <e. 

pk>\t\>k x / 2 L 



By Theorem 3.2 (cf. Propositions 5.1-5.3), 



^(A n (t)-A n (0)) = -^^^ 



1 + 
1 



+ 1 (h t -b t )+- t (^(t)-b 3t ) 

By Proposition 5.3 [cf. (5.19) and (5.20)] and Lemma 6.9, 

fc — 1*1 



sup 

\t\<pk 



(bt+h 



1 



fcV2 



O r 



sup 

\t\<pk 
1 



1 



T72 ^'pi^sv+s - 1) + O p (l] 



kV 2 



p=\t\ 



n k i/2j> 



since by Condition C.5 tp'(u) is continuous, so that sup^^ 
{2irI £:P+s — 1)| = O p (l) by Lemma 6.7, and thence by Lemma 6.5, 



(5.31) sup 

|t|<p& 



-(bt+b t ) + -(6 3t - It) + ^(b 4t ~ l t ) 



o P (fe" 1/2 ). 



By Condition C.4, there exists a finite positive integer r such that k\ < 

k 1 / 2 < ki . Consider first the case 2k\ +r ^ < k. Then the left-hand side 
of (5.30) is bounded by 



k 



sup -(A n (t)-A n (0))>0 

L f _ 1 fe/fc^- 1)/3 >|t|>L £ fc/fcf * 



(5.32) 



+ Pr 



+ Pr 



sup 

L r fc/fe[ /3 >|i|>2fciZ 



(A n (t)-A n (0))>0 



sup 

2k 1 L>\t\>k 1 / 2 L t 



-(A n (i)-A n (0)) >0 
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where Lq = p, > for £ > 1 and L > 0. Since h^aijj" > and A n (t) 
A n (0) > 0, the third term of (5.32) is bounded by 

k\ k 



Pr 



O, 



+ 



+ 1 



>a|V' // | inf 

2fc 1 L>|t|>fc 1 /2X 



< e 



for L large enough, by Lemma 6.8(b) and (5.31), and because Condition C.5 
implies that k 2 k~ 3 / 2 = o(l), and Ar 1 / 2 ^ - * 9 = fcV2fcJ +r/9 /(A:fcJ 1+r)/3 ) = O(l). 
Next, the second term of (5.32) is bounded by 



Pr 



O r , 



k 



k\ 



(r+l)/3+l 



+ 



k\ log k\ 



+ 



= Op(l) 

>a\^"\ 



inf 



Lik/k[ >\t\>2Lhi 



< e 



for L large enough, by Lemma 6.8(a), (5.31) and the fact that k x l 2 = o(ki), 
k 1 ' 2 < k { { +1)13 and Condition C.4. 

Finally, consider the first term of (5.32), whose typical element is, pro- 
ceeding as before, bounded by 



Pr 



,10 

n 



k 



+ 



A i 



fclogfci fcV2 



= P (l) 
>a\$"\ 



inf 

L f _ifc/fc< £_1),3 >|t|>Lffc/fcf 



tkl 



I'll 



<£, 



because ^ < ^y^- 



o(fc -1 fcP +1 ) = o(l) by Condition C.4 and £ < r. 



Next consider the case fc<2fcf""Mn this case, let f be the biggest integer 

but k < /c} + ^ r+1 ^. The proof now proceeds 



such that 2k 



l+r/3 
1 



< k < k 



l+r/3 
1 



identically as in the previous case, but now the sum of the first term of (5.32) 
runs from i = 1 to f. Note that if k < 2/c| + ^, so that r = 1, then the left-hand 
side of (5.30) is bounded by 

k 



Pr 



sup 

pk>\t\>2Lki_ 



(A n (i)-A re (0))>0 



+ Pr 



sup 

2Lfci>|t|>fc 1 /2l 



(A n (t)-A n (0))>0 



and then proceed as in the proof of the second and third terms of (5.32), since 
Condition C.4 implies that k\ = o(k 2 ), and recalling that now k < 2A^ + ^, we 
have k\~ P k~ l l 2 = o(l) by Condition C.4. □ 
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5.4. Proof of Theorem 3.4. We begin with part (a). Observing that Con- 
dition C. 7 implies that \h w — h w | = 0(m _1 ) and m _1 J2p=i w p = 0{m~ l ) by 
Lemma 6.10, the behavior of (2m) 1 / 2 (a(A s ) — a) is governed by that of 

' l W V^... !__/ Jp+SJS-p 



w p log(g p+s g s - p ) + " w P log 



(5 ' 33) (2m)V2 a /i - / p x \ 

-H^{m^ plog U +K i 

Recall that f p ~ 1 fp = g P - By Lemma 6.10 the last term of (5.33) is o(l). 
Denoting m\ = [m\ log mi], the second term of (5.33) is 



(5.34) 



Next, because by Lemma 6.1 D 1 < X? ni fs±p < D for \p\ < 2m*, we have 
that the first term of (5.34) is 0{m~ 1 / 2 log(mi) E^i \ w v\) = 0(m} +C m-^ +1 )/ 2 x 
log 1+c (mi)) = o(l) by Conditions C.6 and C.7 since Q > 1/3. Denoting 
g(\p) = g p , the second term of (5.34) is 

t — 1 m 

E ^{log(A; 1 |A p + A s -A |)+log(A; 1 |A s -A°-A p |)}. 

^ ' p=rrij+l 

Because, for \p\ > 2m*, Lemma 6.1(a) implies that D" 1 < m^ 2 p 2 \f~+ s f p+s — 
1| < D, we obtain by the mean value theorem that log fp+ s f p + s = 0(m 2 p~ 2 ) 
and so the first term of the last displayed expression is bounded in absolute 
value by 

£^ £ \ Wp \p-> = 0{ m^m-^/% 

V / p=m*+l 

whereas the second term is 4ir 2 Bc 5 ^ 2 / (2 1 / 2 h w ) + 0(m _1//2 ) because 
^ m 
— -— £ ^ p log( 5p+s5s _ p ) 

^ ' p=m*+l 

2 m * 4ir 2 Bc 5 / 2 ( 1 
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by Condition C.6 and Taylor expansion of log(g p+s ) and log(<7 s _ p ) around 
log(<? s ) and that by Lemma 6.10 and Condition C.7, J2p"=i w p = 0(1). So, 
the second term of (5.33) is 4-7r 2 i?c 5//2 /(2 1/ ' 2 /i U) ) + o(l). Finally, proceeding as 
with the second term of (5.4), the third term is easily shown to be bounded 
byDm-^ 2 Ep\w P \p- 1 =o(l). 

Denoting r &b = 9b~ L and proceeding as with the proof of Lemma 6.5, to 
complete the proof of the theorem, it suffices to show that 

u—l m 

(5-35) ]T w p {d p+s + tf,_ p ) 4 jV(0, h~ 2 <S> 2 ), 

( 5 - 36 ) 7^m{>2+ 12 K«h. + ^-p)^0. 

We begin with (5.36). By Lemma 6.2(b), the first moment of the first 
sum inside the braces on the left-hand side of (5.36) is oinT 1 ! 2 J2p=i \ w p\) = 
o(m} + ^m~( 2< ' +1 )/ 2 ) = o(l) by Conditions C.7 and C.6 since £ > 1/3, whereas 
the contribution due to the second sum inside the braces on the left-hand 
side of (5.36) is O p (rrii m 1 ' 2 ) = o p (l) by Lemma 6.2(a), Condition C.6 and 
Markov's inequality. So, it remains to show (5.35), whose left-hand side is 

,_1 /2mi m \ 

(5.37) —^(J2M^P+s + ^- P )+ E + • 

{Zm) y p=l p=2mi+l / 

Because Lemma 6.2(b) implies that E\$b\ = o(l) for \b — s\ < 2mi, the first 
term of (5.37) is o p {m\ +C -m~^ +l ^ 2 ) = o p (l) by Conditions C.7 and C.6 
and Markov's inequality. But proceeding as in the proof of Proposition 5.3 
[cf. (5.21)], the second term of (5.37) converges in distribution to Af(0, h~ 2 Q 2 ), 
which completes the proof of part (a). 

Part (b). Dropping the constant h w and (2m)^ 1 / 2 , it suffices to show 
that 

m 

(5.38) E w p{ lo g(9p+s-t/g P +s) + log(g s - p - t / g s - p )} = o p (m 1/2 ) 
p=i 

holds uniformly in \t\ < \k 1 ' 2 M\ = o(m 1 ' 2 ). We only examine the contribu- 
tion due to the first term on the left-hand side of (5.38); the contribution due 
to the second term follows by identical steps. The first term on the left-hand 
side of (5.38) is 

2t m 

(5.39) Y w p l °s(9p+s-t/g P +s) + E w P l °g(9p+s-t/g P +s)- 

p=l p=2t+l 
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Using (5.7)— (5.9) and Markov's inequality, the first term of (5.39) is, uni- 
formly in t, o p (Y^p=i M ^ \ w p\) = o p (m 1 / 2 ) by Condition C.6. Next, the second 
term of (5.39) is 



2f 



m—t 



Y W P+tl°gg P +s ~ Y W P lo S9p+s + Y 
p=t+l p=m-t+l p=2t+l 

By (5.7)-(5.9), the first two terms of the last displayed expression, uniformly 
in t, are o p {k 1 / 2 ) = o p {m l l 2 ) by Condition C.6. 



Finally, we consider the third term in the last displayed expression. Let 
tip = 9p ~ 1- Since by Lemma 6.3 and Markov's inequality, sup p=1 > ... i [ n /2] Wp\ = 
o p (l), except in a set Q n such that linin^oo Pr{f2 n } = 0, it implies that 
logg p+s = $ p +s — 2 _1- #p_|_ s (i + Op(l)) by Taylor expansion. So, the third term 
of the last displayed expression is 

( m—t 2mi \ 



(5.40) 



Y + £ 

,p=2mi+l p=2t+l) 



W p )$ 



p) u p+s 



m—t 

+ D Y \ w p+ i 

p=2t+l 



Wp\€+ s {l + o P (l)). 



Since Condition C.7 implies that |u> p +t — w p \ < D(t/m)^ , we have that 



sup 

t<[k 1 / 2 M] 



m—t 



Y \w p+t -w p \$ 

p=2t+l 



2 

p+i 



<D 



sup — I 

"1 / 



t \ 



+ sup 

t<[k 1 / 2 M] 



2m\ 

Y w p®. 

p=2t+l 



2 

p+s 



Y 

(<[^M]Vmy Vp=2mi+1 

But by Lemma 6.2, E\ti p+S \ 2 = 0{m{ 1 ) if 
implies that sup p=1| _ r n / 2 i $p+s = o p (l). Hence, by Markov's inequality and 
Conditions C.6 and C.7, the third term of (5.40) is o p (£^/ 2 ?ti 1 ~ < >/ 2 ?ti^ 1 + 



> 2mi, whereas Lemma 6.3 



m s m 1 1 *J = o p (m ' ). Proceeding similarly and in view of Condition C.7, 
the second term of (5.40) is o p (m^ 2 (/c 1 / 2 /m)^) = o p (m 1 / 2 ). So, denoting b t = 
Z)^2mi+i ( w p+t ~ w p)^p+s, that is, the first term of (5.40), to complete the 



proof we need to show that sup t< r fc i 



[fcV2M] 



sup 



9=1, 



is Op(m 1 / 2 ). Now, by the triangle inequality, sup t 



,[Mfc!/4] SU P( 9 -l)feV*<t< g fcl/4 N 



<[fcV2M] 



\bt\ is bounded by 



sup sup 

9=1,..., [Mk 1 / 4 ] (q-l)k 1 / A <t<qk 1 / i 



m—t 



Y ^p+t - w p+gkl/4 )^ p+t 

p=2mi+l 



(5.41) + sup sup 

g=l,...,[Affc 1 /4] (g-l)fcl/4<t< g jfel/ 4 
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p=m—t+l ) 

Because (sup,- [c 7 -|)' x = sup,,- \cj\^ < J2j \ c j\^ f° r M > 0, the second moment of 
the second term of (5.41) is bounded by 



E U 

q=l I 



E (%+<?fc 1 /4 - W p )$ p+t 
p=2mi+l 



[MfcV2] 

+ E * 



E (%+gfc 1 / 4 - w p¥p+s 
p=m—£+l 



[Mfc 1 / 4 ] c [Affc 1 / 2 ] 

<D £ U+ E 

<j=l I £=1 



£> 2C _ i =0 (m), 



proceeding as with the proof of (5.16) and noting that Condition C.7 implies 
that \(m/p)^w p \ < D and |(m/(< ? A; 1/4 )) C {w p+qk i/4 — w p )\ < D with £ > 1/3. 
The second moment of the first term of (5.41) is bounded by 



[AffcV4] gfc 1 / 4 

E E ^ 

q=l 4= ( g _l)fcl/4 +1 



m—t 



J2 (wp+t - w p+qk i /i )'d p+t 

p=2mi+l 



E E 



m 



2C 



E 



9=1 f=(g-l)fc 1 / 4 + l 



m—t 



E 

p=2mi+l 



where |"U>* )t | = ((jri/^ 1 / 4 )^^^ — w p+9fc i/4)| < D by Condition C.7. Now pro- 
ceeding as with the proof of (5.16), the right-hand side of the last displayed 
equation is 0(m 1 ~ 2< > k^ l+ ^/ 2 ) = o(m) since £ > 1/3 and k = o(m) by Condi- 
tion C.6. Using Markov's inequality we conclude that (5.41) is o p {m 1 / 2 ) and 
the proof is complete. □ 

6. Technical lemmas. From now on J2j denotes Yuj=—k! anci kin" 1 — > 0. 



Lemma 6.1. Let f p be as defined in the proof of Theorem 3.2. Then 
D- 1 < f p l f p < D, 

(/-7p - 1) = o(fc?/b - *l 2 ), Ip-«I> 2fei. 

(b) D'^X^fpKD if\p-s\<2k 1 . 



(a) 
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PROOF. First observe that by definition of f p , f 1 f p = (2k\ + 1) 1 J2jf P 1 x 
f j+p . We begin with (a). We first show that D^ 1 < (2ki + l)" 1 J2j f^fj+P < 

D. Because |A° - A s | < £, D' 1 < |1 + affi+p-J) I < A for b'l < *i, so that 
Condition C.l implies that 



D 



k 



-1 [fcl/2] 

- E 

i=[fei/4] 



j +p 



< 



2A- 



— E 



But |p - s| > 2fci and |j| < fci imply that 2/3 < |(p - s)/(j + p - s)\ < 2. 
From here the conclusion is standard since a > 0; we conclude the first part 
of (a). Next, we show the second part of (a). By Taylor expansion of fj+ p , 
the left-hand side is 



2k 



—E 



(2vr)j /; (2vr) 2 j 2 /"(A) 



2n 2 f p 



<-E 



p — s + Sj 



(1 + o(l)), 



where A = A(p + 5j) is an intermediate point between X p and A p+ j and 
5 = 5(j) € (0,1), by Condition C.l and the fact that / p ~ 1 /(A) is bounded. 
The conclusion follows since \p — s + Sj\ > \p — s\ — \Sj\ > \p — s\/2. 

(b) It is immediate since by Condition C.l, /y+ p )zQ;+p^s) = D\^ p _ s ^(l + 
o(l)) and A^ Q A^ =o(k±). □ 



Lemma 6.2. Denote (p(ki) = 0{k 1 1/2 ) + 0{k^ l )l{a > l/2)J(|p - s\ < 
2k\). Then 



(a) 
(b) 



E\f- 1 f p -l\ = i P (k 1 ), 
E\f- 1 f p -l\ = i P (k 1 ). 



PROOF. We begin with (a). L 1 f p -1 is 



(6.1) 



f-\2k 1 + l)- 1 J2 fi- 

j+p^s 



Lj+p 



f 



3+P 



+ f-\2h + l)- 1 ^ - f s +i)X(\p -s\<h 



In view of Propositions A.l and A. 2 of [22] and Lemma 6.1, the first term 
of (6.1) is <p(ki), whereas the second term of (6.1) is also <p(ki) by Lemma 6.1(b) 
and E{n~ a I s ) < D. 

To show part (b), it suffices to examine f p ~ 1 (f p — f p ), which is by definition 



(6.2) 



/- 1 (n- 1 -/ p )X(/ p <n- 1 ) 

= a/" 1 *" 1 - 1) - (t l f p - l))l(f p < n- 1 ). 
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By the Cauchy-Schwarz inequality, the second moment of the right-hand 
side of (6.2) is bounded by 

2E(f- 1 f P ~ I) 2 + 2(/- 1 n~ 1 - l) 2 E(l(f p < n' 1 )) < DE(f- l f p - l) 2 , 

using the fact that E(X(f p < re -1 )) is 

Pvif-'fp - 1 < /-V 1 - 1} < PrUf-'fp - 1| > |1 - !p X n- l \), 

because by Lemma 6.1, f~ 1 n~ 1 — 1 < —D for n large enough. Now use 
part (a) and Markov's inequality to conclude. □ 

Lemma 6.3. Let 2k\ < v < u < [n/2] and p = 0, 1, . . . , [n/2] . Denoting 

^v,u)=O r a f-J^ /T ) and^k)=0{\o g '^k), 
k 1 



(a) 



(b) 



El sup \f-Hfp-f P )r)=<P(ki), 

\p:|p-s|<2fci / 

e( sup \f-\f p -f p )\)=^v,u), 

\p:2ki<\p—s\=v+l,...,u / 

E sup \fp 1 (f p -f P )\ ft = <p(k 1 ), 

P '■ \p—s\<2ki 

E sup If^ifp-fp)] =i>(v,u). 

p: 2fci<|p— s\=v+l,...,u 



Proof. For notational simplicity we shall take s = 0. We begin with 
part (a). From Hidalgo and Robinson's [22] Proposition A.l, it suffices to 
examine the behavior of f p {f p — Ef p ). On the other hand, Hidalgo and 
Robinson's [22] Proposition A. 3(a) and (b) implies that it suffices to examine 
the behavior of f p ~ 1 (f e ,p — Ef EtP ), where 

p = 2k +1 ? fj+p^,j+p 
1 j 

and I £iP = / e (A p ) is the periodogram of {et}" =1 . We examine sup p=i;+lj u \ f~ 
(fs, P ~ Ef £yP )\ only; that of sup p=lj ^ \fp l {fe, P ~ Ef £)P )\ is similarly han- 
dled. Because sup^ \aj \ = (sivpj \aj\ T ) 1 ^ T , the rth power of sup p=t;+1 u j/" 1 x 
(/ £)P — Ef £ ^ p )\ is, except for constants, 



sup 

p=v+l,...,u 



2ki + 1 fa+pAi^Iej+p ~ 1) 
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2 T 1 sup sup 

q p 



J 



(6.3) 



'j+b,p(( 27r )4,j+6 - 1)) 



+ 2 T 1 sup sup 

q p 



1 



2k 



where su P(? and sup p denote ^P g=1+v/k ^ y ^ u/k i/r and su Pp=1+6 _ fcl/T) & , 

respectively, and b = qk\^ T . 

After the change of indices j = j' — k±, the second term of (6.3) is bounded 

by 



D sup sup 

q p 



1 



2k 



—7 X] (fy+b-kup ~ <Aj+b+l-fci,p) 

1 ' a — n 



j=0 



(6.4) 



£((27r)/ £ia+6 _ fcl -i; 



a=0 



+ Z?supsupj0 b+fcli 



9 V 



Y 2fci 

2liTT§ ((2 ^ )/ej+fc " fcl " i; 



by Abel summation by parts. On the other hand, by Conditions C.l and C.3, 
i+b+i-k^pl^Df^ij + b-h)- 1 



Pj+b-fci.p — <Pj- 



n 



< D 



since by Lemma 6.1(a) D" 1 < \ fp 1 f p \ < D. So, using sup^ \dj\ T < J2j \ a j\ T ■> 
by Holder's inequality and D~ l < \<j>b+ki,p\ < D by Lemma 6.1(a), we obtain 
that the first moment of (6.4) is bounded by 



2k 



,,1/r 

D u / k l 2ki , 



V 



q=l+v/k^ /T 3 

x E 



+ b-ki 



(j + b- h 



3 

a=0 



J2((2n)I £>a+b - kl -l) 



38 J. HIDALGO 



u/k 1 



1/t 



d E E 

q=l+v/k\ /T 



2ki 

2fc7Tig ((27r)j£ ' i+6-fel " i: 



which, because (2ir)EI e j +p = 1 and proceeding as in the proof of Brillinger's [5] 
Theorem 7.4.4, is bounded by 



U / k l / / U \ TQ 1 2fc 



d T ff — ) 1 T {j + 1)T/2 i fc- r/a 



g=l+r;/fci 



//iy/2 / i \ r/2x /rnax(u-u,fci) 



because a < 1, 6 < 2(6 — g > 1 + t>//c^ T and 6 = qk\ . Thus, we conclude 

that the second term of (6.3) is 0(max(u — v, k\) /k[^ 2+1 ^ T ). 
Next, we examine the first term of (6.3). Because 

2fci 

a pM = ^(^j+p-kiAi^Iej+p-ki ~ 1) - </ , i+b-fci,p((27r)/ £ij+fo _ fcl - 1)) 

has at most k\^ T terms, and because (2tt)EI £ j +p = 1 and proceeding as in 
the proof of Brillinger's [5] Theorem 7.4.4, its rth moment is bounded by 

1/2 

h± , so that the expectation of the first term of (6.3) is bounded by 

u/k\ /T b 

D E E *T- T 

q= l +v /k\ /T p=l+b-k\ /T 

because r > 2. This completes the proof of part (a). 

To show part (b), denoting a p = J" 1 ?! -1 — 1 and using (6.2), it suffices to 
examine 

sup|a p |T(/ p < n" 1 ) < Dsup X(] p x f p - 1 < a p ). 
v v 

But the expectation of the right-hand side is bounded by 



u — v 



DEli sup \ f p 1 f p — lj > min \a p \ 
\ v v 

= £>Pr jsupl/^/p - 1| > mjn|a p || 

- D ( m p in \ a p\j E^sup Ifp^-fp - 1 
< D((p(ki)X(p < 2ki) + ip(v,u)l(p > 2ki)) 



SEMIPARAMETRIC ESTIMATION OF THE POLE 39 
by Markov's inequality and because (sup„ |c p |) r = sup p |c p | T . □ 

Lemma 6.4. Let h(u) be a twice continuously differentiable function in 
(0, 1) such that h(0) = h'(0) = h(l) = 0, where h'{u) = -£h(u). Consider a 
sequence {uj} such that \j 2 Vj\ < D for all j. Then, for a > It and denoting 
q = p-t + l, 

(6.5) £ h{j/v){v v ) = o(^\og(^)+— 2 +- 2 

\p 2 \tj pq 2 p 2 

PROOF. The left-hand side of (6.5) is 

p a 

E Ki/p>3- E H(j + t)/p>j 

j=p—t+l j=a—t+l 

+ E (h(j/p)-h((j + t)/p))uj. 

j=a+l 

Since the first derivative of h(u) is continuous and h(l) = 0, from the mean 
value theorem it follows that the absolute value of the first term of (6.6) is 
bounded by 

D E \(p-j)/p\\^\ = 0(t 2 /(p(p-t+l) 2 ))=0(p~ 1 q- 2 t 2 ), 
j=p-t+l 

using \j 2 Vj \ < D. The absolute value of the second term of (6.6) is bounded 

since h(0) = h'(0) = and \j 2 Vj\ < D, whereas the absolute value of the third 
term of (6.6) is bounded by 



p-t 



D- 

P j=a+l 



h'( J -)+-h"( J -+Z t 



P J P \P P 



by Taylor expansion of h'{x) and using /i'(0) = 0, where £ = £ (0, 1). 

□ 

Lemma 6.5. Let bt, b%, bzt and b^t be given in (5.16), (5.10) and (5.15), 
respectively. Then, for p < 1/3, 

(a) sup t < pfc t- 1 \b 3 t-b t \ = o p {k- 1 / 2 ), 
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(b) sup t < pk t 1 \b 4t -b t \ = o p (k 1 / 2 ). 

Proof. We only examine part (a); part (b) is identical. Because by 
Lemma 6.3, sup^ =1 ^„/ 2 ] \§£ ~ M = O p (\og~ 2 ki), then except in a Set ^lifi 
such that lim n Pr{O n } = 0, logg. +s = -&. +s - 2" 1 tf 2 +s (l + o p (l)) by Taylor 
expansion, which implies that for n sufficiently large, by definition of b^t, 



k-t 

t<pk t<pk p= 2t+l 



supt 1 \bst — h\<D sup t 1 typ - ipp+tltip 



Because by Lemma 6.2, for \p\ > 2ki, $ p+s = O p (k 1 1 ), for \p\ < 2ki, ^ p+s = 

O p (kl [a ' l) I(a > 1/2) + k{ 1 ) and by Condition C.5, \tft p - tft p+t \ <D\^\t/k, 
where ift' p = tp'(p/k) and p < £ < p + t, the last displayed expression is 

( l{a > 1/2) ffi sup f<2fcl |^| | 1 * sup t < pfc \^\ \_ 

\ ft i p=i 1 p=i / 

by Conditions C.5 and C.4. □ 

Lemma 6.6. Let (ftp = (ft(p/k), where (ft{u) is a continuous function in 
(0,1). Define 

2 M 
c r (/z;tf) = — — ^ ^ (ftpCos{rX p ), 

where < [i < $ < 1. For any fj, < $i < $2 < 1 ; if k/n—*0, then 

n—l n—ri $ 1 

( 6 - 7 ) E E c r2 (/x;^i)c r2 ( At ; 1 ? 2 )= / ^ 2 (n)^(l + o(l)). 

ri=lr 2 =l 

PROOF. The left-hand side of (6.7) is 

4 [fc#i] [fctfa] n-l n-n 

^27; E 0pi E E E cos(r 2 A Pl )cos(r 2 A P2 ) 

pi=[fc/*]+l p 2 = [M+l n=lr 2 =l 
^ [fei?i] n-l n— n 

= rfk E <^pE E cos 2 (r 2 A p ) 

p=[fc /J ]+l n=lr 2 =l 

(6.8) 

2 [fe^i] [fctfa] n-ln-n 

+ ^ E ^pi E ^p 2 E E { cos ( r 2 A Pi+p 2 ) 

pi = [fep]+l P2 = [M+ 1 .P2^Pi ri=lra=l 

+ cos(r 2 A Pl _ P2 )}. 
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Because (see [32]) YT r ~=x £r 2 =i cos 2 (r 2 A p ) = (n - 1)74 and 

for pi^p 2 , 



n—l n—ri 

E E { cos ( r 2A pi+P2 ) + cos(r 2 A pi _ P2 )} 

ri=l r2=l 



-n 



the right-hand side of (6.8) is 



(n- l) 2 / 1 



E ^p J ~ ^ E #w ^p 

p=[fcp] + l / pi=[fe/x]+l P2 = [k^l]+l,P2^Pl 



[iWa] 

E 



2 (n)dn(l + o(l)), 



because is continuous in n and k/n^ 0. □ 

Lemma 6.7. Denote r\ p = (2ir)I £tP — 1 and <^>(n) as in Lemma 6.6. T/ie 
process 

fc-[fc0] 



1 



fcV2 



£ <M P , < < 1/2, 



P=IM]+1 



is ifo/it. 



Proof. Since by Proposition 5.3, the finite limit distributions of Rn($) 
converge to those of a Gaussian process with continuous paths, then by 
Billingsley's [4] Theorem 15.6, it suffices to check the moment condition 

(6.9) E(\RM - RnWriRntf) - Rn(#i)\ T ) < D($ 2 - d x f 

for some r > 0, ip > 1, where < < ^ < < 1 /2. Because 



R n {$)-R n {d 2 ) 



1 [k& 3 ] 1 k-[h&] 

E ^f+Ii72 E <t> P ri P , 

p=[kd]+l p=fc-[fc# 2 ]+l 



a sufficient condition for (6.9) to hold is 



E 



1 



[M 2 ] 



fc l/2 



E ^p 



(6.10) 



P =[fci?]+i 



1 



fc l/2 



fcl/2 



E 



p=fc-[fci? 2 ]+i 



fc l/2 



E ^p 

p=[fci?i]+l 
fe-[JWi] 

E ^p^p 

p=k-[kff\+l 



We will examine the first inequality of (6.10) only; the second displayed 
inequality is similarly handled. 
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By definition of rj p , 



TTJ2 E <Mp=llfe E ^ — Z>r"l) 
K p=[JW]+l V p=[iW]+l / V ' L r=\ / 

n r— 1 

+ E £r E e a c r-a($, ^2) 
r=2 o=l 

:=^l,n(^,^2)+^,n(^^2), 

where c r (#, #2) was denned in Lemma 6.6. Because | X^SjU+i 4>v\ — Dk\d2 — 
tf\ by continuity of and E(Y,r=i( £ r ~ I)) 2 < Dn by Condition C.2, 
£(|£i )n (tf,t? 2 )||£i,n(tfi,#)|) < ($2 - #i) 2 by the Cauchy-Schwarz inequality 
and |0a - #P - Vi| < |0 2 - i9ij 2 - That is, £i,„(0,02) satisfies the first in- 
equality in (6.10) with r = 1 and ip = 2. So, to complete the proof, it suffices 
to examine that the first inequality in (6.10) holds for £2, n {$, $2)- The fourth 
moment of £2,71 ($> $2) is 

4 e iWi>%^-%^) 

. 2=n<r2<r , 3<r4 j=l \Oj=l / 
4 / \V2 

<^n e <*,-., (<w) 

j=l \l<aj<rj<n / 

= D( Yl C r-a(<w) , 
\l<a<r<n / 

proceeding as in the proof of Lemma 5.4 of [12]. But proceeding as in 
Lemma 6.6, the right-hand side of the last displayed equation is bounded by 

J <j?{u)du\ <D($2-$if 



since <f>(u) is continuous. This concludes the proof, choosing r = ip = 2. □ 



Lemma 6.8. Let 2k\ <t$ < pk for some arbitrarily small p > 0. Then 
(a) 



2fci<t<* 

(6.11) 



sup \t'\^\t)-ht)\=O p (^ + ^-]., 
i<t<* \kkf /c 2 log/ci/ 

sup r\^ ) (t)-b u )\=o p (^+-^-)., 

2fci<i<t \fcfc£ fc^log/ciy 
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(b) 



(6.12) 



sup \ t -\^\t)-b 3t ) 

Lfc 1 /2< i <2fc 1 



sup It-^it) 

Lk 1 / 2 <t<2ki 



Hti 



Or 



k 2 log k] 



fc? 



+ 



Am 



+ 



k 2 log fei kk 



kk{ 



where 634 and bu are given by (5.10) and (5.15), respectively, and L > and 
(3>0. 

Proof. We begin with (a). We only examine the first equality in (6.11); 
the proof of the second equality is similarly handled. By definition (see 

Proposition 5.3) £n (t) = &it + bit- First, swp 2kl<t<tQ r "it I satisfies the 
equality in (6.11) by Condition C.5 and since the sum in p has at most 2k\ 
terms, say p* = 1,.. . ,2fei, for which sup p * =lj 2jfel \logg p * +s \ = O p (log _1 fei), 

whereas for the remaining terms sup^—^ |log^p_L. s | — Op(J^\ ) vising 

(5.7) and (5.9). 

Next we estimate b 2 t - b 3t . First by (5.11), sup 2 k 1 <t<t t^lht - &3t| is 



sup t~ 

2ki<t<t 



k 2t 
p=k-t+l p=t+l 



= 0^^^ +t 2 k~ 2 k~ fJ ), 

by Condition C.5 and using (5.7) and (5.9) and Markov's inequality. 

Part (b). As was done in part (a), we only examine the first equality 
in (6.12); the second is similarly handled. By Condition C.5 and (5.7) and (5.9) 
together with Markov's inequality, it follows easily that sup Lk i/2 <t<2kl |i _1 &it| = 
O p (k~ 2 k 2 log -1 k\). Finally, we estimate b 2 t — &3t- As was done in part (a), 
it suffices to examine 



sup t 

£fcV 2 <t<2fci 



Y ^P l °s(9 P +s 
p=k-t+i 



+ 



21 



p=t+l 



The second term of the last displayed expression is O p (kfk 2 log 1 fei) by 

Condition C.5 and using (5.7)-(5.8), whereas the first term is O p (k\k~ 1 k^) 
using Condition C.5 and (5.7) and (5.9), which concludes the proof. □ 



Lemma 6.9. Let d>(u) be as in Lemma 6.6. Then 



sup 

o£[0,l] 



[ku>] 

E 1 



ij+s 



fj 



+s 



£,J+S 



oJl). 
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Proof. Writing Uj = fj+[ 2 LOj+ s ,x and vj = (2ir) 1/2 id j+Sj£ where u)j +s>x 
and ^Oj+ s ,e are t ne discrete Fourier transforms of x r and e r , respectively, the 
left-hand side of the last displayed expression is, by the triangle inequality, 
bounded by 



(6.13) sup 



1 



[ku>] 



ue[o,i] ^ 1/2 ^ =1 



j i +2 sup 

we[o,i] 



1 



fcV2 



[kcj] 



where c denotes the conjugate of the complex number c. 

The first term of (6.13) is o p (l) since its expectation is bounded by 



fc- 1/2 £N{(£| 



1) 



(6.14) 



(E( UjVj ) - 1) - - 1) + (E\ Vj \ 2 - 1)} 

A' 



log J 



i=i 



because -E'l'UjI 2 = 1, [<£i| < D and by the extension of Theorems 1 and 2 
of [31] given in Lemma 4.4 of [12]. 

Next, to show that the second term of (6.13) is o p (l), it suffices to show 
that the finite-dimensional distributions of the term inside the absolute value 
converge to zero and the tightness condition. First, choosing id* such that 
[kid*] = max(p], [fcwi]) for some < ( < 1/4, then for any < u\ < u>2 < 1, 



2E 



[ku 
[fcw 2 ] 

^ 1/2 E ■ 

J=l+[to*] 



Vj)\ is bounded by 



+ 2k~ 1 [kid*]([kidl] - [kuj x ]) 



(6.15) < Dk~ l log 2 fc(([£^ 2 ] 1/2 - [M] 1/2 ) 

x (log(faj2) - logffcwj)) + MCM - [fcwi])), 

proceeding as with (4.8) in [32]. So, the finite-dimensional distributions of the 
second term of (6.13) converge to zero in probability by Markov's inequality. 

To complete the proof we need to show tightness. Since the limiting pro- 
cess has continuous paths, by Billingsley's [4] Theorem 15.6, it suffices to 
show that 



(6.16) E 



j=l+[fcwi] 



>.i r j["j >\i 



<Z)(IT(c^)-H(wi)) 



l+<5 
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where 5 > 0, < uj\ < u>i < 1 and H(uj) is a nondecreasing continuous func- 
tion. The left-hand side of (6.16) is bounded by 

£T 2 (|M 4 | +3M|), 

where M r denotes the rth cumulant of Y^j=\j r [ku) 1 ] ( l ) j v j(.^'j ~ Using the 
inequality in (6.15), hr^M^ < D(H(u>2) — H(uji)) 1+s , so it remains to show 
that A; _2 |M4| satisfies the inequality in (6.16). Now fc _2 |M4| is 

j [fcw 2 ] / 4 \ 

ilJ2J3,i4=l+[fcwi] \i=i / 

where we have abbreviated u.j — u,- by Zj. By Theorem 2.3.2 of [5] and 
denoting Xj\ = <J)jVj and Xj2 = 4>jZj, 

(6.17) = ^cum(A>; j£ 6 #i) • ••cum(X i< ;jl G # p )> 

where the summation is over all indecomposable partitions $ = $i U • • • U 
$ p . A typical component in cum(Xji;j£ G i9i) has gi elements and q<2 
elements Zj, so applying formulae of [5], (2.6.3), page 26, and (2.10.3), 
page 39, we deduce after straightforward calculations that cum(Xji;jl G $i) 

is fljewi <t>j times 

A t gi+<?2 

fc(9i+</2)/2 

^(A 1 + • • • + A^ 1 ) + u 1 + • • • + z/fe)/3(-A x ) . . . ^(-A 91-1 ) 



x /3(-^ 1 )---/3(-z/« 2 ) 

x Efr..^...^ (A 1 ,..., A^- 1 ), i/ 1 , . . . , i/* 2 ) dA 1 • • • dA^ 1 ) dv 1 ■ ■ ■ dv^ , 

where E h ... jqtl ... tp (A 1 , . . . , A^" 1 ' , v\ . . . , iA>) is 

G(A jx - [A 1 H h A^" 1 ) + ^ + • • • + u p ])G(X h + A 1 ) 

x ••• x G(X jq + X^Giu 1 - X h ) x • ■ • x - X lp ), 

with G(A) = £r=i e i4A and, say, /3(-^) = ff^PirV 1 ) - 1. But by a routine 
extension of Lemma 3 of [32] and observing that in each partitioned v, the 
subindex ji, i = 1, . . . , 4, appears only once, 

/ [*wa] 1 \ 4 
(6.17) < DAT 2 2 ^2 < £>(#(u; 2 ) - (c^)) 4 , 

where H{u) =w 1//2 , which is a nondecreasing continuous function. □ 
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Remark 6.1. An alternative proof of this lemma can be found in Lemma 4 



Lemma 6.10 ([5], page 15). Let h(x), < x < 1, be integrable and have 
an integrable derivative h^(x). Then 



7. Conclusions. In this paper we have studied a nonparametric estimator 
for the pole of a long-memory process under mild conditions on the spectral 
density /(A). Specifically, we have only assumed that /(A) ~ C|A — A°| _Q 
with C > 0, but smooth elsewhere, and where a, the memory parameter, 
belongs to the interval (0,1). We have shown that the estimator A of the 
pole A is consistent and we have characterized its limit distribution. More 
precisely, A , centered around A and appropriately renormalized, is asymp- 
totically normal when A € (0,7r), whereas if A = {0,7r}, the asymptotic 
distribution is a mixture of a discrete and continuous random variable. In 
particular, when A = the asymptotic distribution takes the value with 
probability 1/2 and behaves as a (truncated) normal random variable for 
positive values. In addition, we have shown that the asymptotic statistical 
properties of a two-step estimator of a are the same as when A is known. 
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